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I.  INTRODUCTION 

Tnis  report  describes  the  work  accomplished  under  grant  AFOSR-77- 
3219  from  the  Air  Force  Office  of  Scientific  Research  during  the  period 
1  February  1978  tnrough  31  January  1979. 

Tne  work  is  divided  into  two  separate  and  distinct  parts.  Section 
IT,  written  by  Alan  Huang,  is  concerned  with  a  variety  of  architectural 
issues  in  the  design  of  any  computer  based  on  residue  arithmetic.  Tne 
results  are  directly  applicable  to  optical  approaches  to  residue  comput¬ 
ing,  but  are  equally  important  in  considering  the  design  of  electronic 
versions  of  this  type  of  computer. 

Section  III,  written  by  Jon  Mandeville,  considers  the  problem  of 
error  detection  and  correction  in  residue  computers.  As  interest  grows 
in  the  use  of  residue  techniques  in  what  are  basically  analog  systems, 
the  consequences  of  errors  in  the  representation  of  residue  numbers  by 
analog  values  must  be  considered.  Tne  issue  is  a  complex  one,  for  often 
the  probability  of  error  will  increase  as  the  size  of  a  given  modulus 
increases,  so  tne  choice  of  moduli  will  affect  the  reliability  of  the 
system.  Sometimes  extra  moduli  can  be  used  for  the  sole  purpose  of  pro¬ 
tecting  the  integrity  of  tne  data.  These  issues  are  explored  in  detail 
in  section  III. 


Our  earlier  report  [Ref.  1]  is  essential  background  reading  before 
begining  this  report.  A  sightly  condensed  version  of  the  earlier  report 
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II.  ARCHITECTURAL  STUDIES 


A.  Background 

The  combination  of  residue  arithmetic  and  optics  was  prompted  by 
the  realization  that  the  cyclic  nature  of  the  residue  number  system  can 
be  mimicked  by  various  physical  phenomena.  Our  initial  investigation 
resulted  in  a  genealogy  of  possible  approaches  as  shown  in  Table  l.  In 
an  effort  to  evaluate  the  relative  advantages  of  the  various  approaches 
we  studied  the  physical  switching  mechanisms  ultilized  by  each  approach. 
Our  findings  are  summarized  in  Table  2.  Speed,  possibility  of  integra¬ 
tion,  and  cost  were  used  as  some  of  the  criteria.  These  results  were 
then  incorporated  into  an  overall  evaluation  of  the  basic  approaches,  as 
summarized  in  Table  3  *• 


The  approaches  listed  in  the  preceeding  table  involve  a  direct  mim¬ 
icking  of  the  residue  system.  Some  mathematical  insights  have  enabled  us 
to  relax  some  of  the  restraints  on  these  technologies.  One  simplifica¬ 
tion  was  the  realization  that  any  cyclic  shift  can  be  synthesized  from  a 
binary  decompostion  of  shifts.  As  an  example,  a  cyclic  shift  of  7  is 
equal  to  a  shift  of  4,  a  shift  of  2,  and  a  shift  of  1.  This  result 
reduced  the  type  and  number  of  shifts  needed. 

Another  important  finding  was  that  the  non  cyclic  maps  needed  for 
multiplication  can  be  generated  with  a  fixed  pre-permutation,  cyclic 
shifts,  and  a  post-permutation  [  Ref.  1,  p.  36  ].  This  allows  technol¬ 
ogy  that  was  previously  only  capable  of  performing  cyclic  shifts  to  also 


*  Tables  1,  2,  and  3  prepared  by  Dr.  Yoshito  Tsunoda  of  Hitachi 
Ltd.  during  his  stay  at  Stanford  from  1976  thru  1977. 
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Rotation  Image 


perform  multiplication.  These  technologies  could  then  perform  decoding 
to  mixed  radix  as  well  as  other  more  complex  computations. 

These  findings  formed  a  foundation  for  a  second  generation  of 
implementations.  Our  efforts  had  been  directed  at  mimicking  cyclic 
shifts  with  physical  cyclic  shifts.  Such  shifts  can  also  be  accom¬ 
plished  by  several  indirect  means. 

The  first  such  approach  was  the  off  diagonal  switching  device  [ 
Ref.  1,  p.  34  ].  Mathematically,  this  approach  relies  on  the  fact  that 
the  partitioning  of  an  ordered  set,  the  reversal  of  the  elements  in  each 
of  the  two  subsets,  and  the  reconcatenation  of  the  two  sets  results  in  a 
cyclic  shift  in  reverse  order.  As  an  example  {0, 1 ,2, 3, 4, 5, 6}  would  be 
partition  into  {0, 1,2,3}  and  {4,5,6}.  Each  of  these  subsets  would  then 
be  reversed.  The  result  is  (3,2, 1,0}  and  {6,5,4}.  Recombining  these  two 
subsets  would  result  in  {3,2, 1,0, 6, 5, 4}  which  can  be  seen  to  be  the 
reverse  of  {4 ,5,6,0, 1 ,2, 3}  which  is  a  cyclic  shift  of  3.  The  primative 
operations  in  this  type  of  cyclic  shifter  are  a  partition  ,  reversal, 
and  concatenation.  This  finding  extends  the  types  of  technologies  that 
are  capable  of  performing  cyclic  shifts.  A  simple  conceptual  example  is 
shown  in  Fig.  1(a),  in  which  two  lenses  are  used  to  perform  a  cyclic 
spatial  3hift. 

Another  second  generation  mapping  device  is  the  "amida  kuzi" 
approach  [  Ref.  1,  p.  41  ].  The  mathematical  basis  of  this  approach  is 
that  any  permutation  (1-to-1  mapping)  can  be  accomplished  with  only 
interchanges  between  neighboring  elements.  As  an  example,  a  cyclic  shift 
of  3  of  the  set  {0, 1 ,2, 3,4, 5, 6}  can  be  accomplished  by  modifying  the 
order  with  neighboring  interchanges  to  (0.1. 2. 4.3.5. 6)  to 


OVERALL  EVALUATION  OF  VARIOUS  IMPLEMENTATIONS 


{0,1,JL£,5*I,6}  to  {0,44_L,5_fc£,&a}  to  3 }  to  {H,5J).,fu±,2, 3) 
to  finally  {4,5,6.0.1,2,31,  which  is  a  cyclic  shift  of  3.  The  primative 
operation  of  this  type  of  cyclic  shifter  is  a  pairwise  interchange.  This 
finding  also  extends  the  types  of  technologies  that  are  capable  of  per¬ 
forming  cyclic  shifts.  A  simple  conceptual  example  is  shown  in  Fig. 
t(b),  where  couplets  of  lenses  are  used  to  perform  interchanges  for  a 
cyclic  permutation. 

These  second  generation  approaches  to  performing  permutations 
extend  the  range  of  technologies  to  be  considered.  This  varity  makes  it 
more  difficult  to  focus  on  any  "best"  approach. 

In  an  effort  to  gain  a  better  perspective,  this  year's  efforts  were 
devoted  to  examining  both  the  potiental  and  limitiations  of  using  vari¬ 
ous  technologies.  Phrased  in  another  way  "given  a  certain  technology 
what  can  and  cannot  be  done  with  it?" 


B.  Given  A  Certain  Technology  Mhafc.  £an  tt£  Eft  &lth  It? 

The  potential  of  such  systems  is  dependent  on  the  types  of  problems 
that  it  can  solve  and  the  throughput  (  data  samples/sec  )  with  which  it 
can  solve  these  problems. 

Some  of  the  problems  that  seem  attractive  for  a  residue  approach 
are  inner  products,  summation,  and  determinate  evaluation.  They  are 
representative  of  many  of  the  problems  in  signal  processing.  They  also 
provide  a  convienent  benchmark  for  comparisons  with  more  conventional 
computational  approaches. 

A  residue  approach  is  capable  of  very  large  throughputs.  There  are 
two  fundamental  ways  of  achieving  this  goal.  One  appoach  relies  on 
speed  while  the  other  relies  on  parallelism.  These  two  architectural 
strategies  favor  different  technological  traits.  The  overall  goal  is  to 
relate  the  characteristics  of  a  given  technology  to  its  performance  on  a 
given  problem  using  a  given  architectural  approach.  This  will  help  to 
define  the  appropriateness  of  a  given  technology  and  also  to  quantify 
the  relative  merits  of  the  a  residue  and  conventional  computational 
approaches . 


i 
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C.  Modular  Processors  Pipelined  £y.  Moduli  (  The  Rabbit  ) 

A  modular  processor  pipelined  by  moduli  is  shown  in  figure  2.  This 
architectural  approach  relies  on  ultilizing  speed  to  achieve  high 
throughputs.  We  refer  to  this  approach  as  the  "rabbit"  approach  or  as  a 
cascaded  number  theoretic  processor.  The  processor  works  in  much  the 
same  way  as  an  assembly  line.  The  coefficients  of  a  desired  calculation 
are  placed  on  the  data  bus  at  the  left.  The  MOD  X  processor  takes  these 
coefficients  and  produces  the  modulus  X  equivalent  of  the  answer  for  the 
desired  calculation.  This  answer  is  also  one  of  the  mixed  radix  coeffi¬ 
cients  of  the  answer;  it  is  passed,  along  with  the  coefficients  of  the 
calculation,  by  the  data  bus  to  the  MOD  Y  processor.  This  processor 
uses  the  coefficients  along  with  the  previously  determined  mixed  radix 
coefficients  to  compute  another  mixed  radix  coefficient.  Meanwhile  the 
MOD  X  processor  is  processing  the  coefficients  of  another  calculation. 
This  process  continues.  Each  modular  processor  uses  the  coefficients 
along  with  all  the  previously  determined  mixed  radix  coefficients  to 
produce  another  mixed  radix  coefficient.  What  emerges  from  the  data  bus 
is  a  mixed  radix  version  of  the  answer  to  the  desired  calculation.  The 
throughput  rate  of  this  pipelined  processor  is  the  reciprocal  of  the 
maximum  modular  processor  time.  The  last  processor  usually  takes  the 
longest  since  it  has  to  consider  the  results  of  all  the  previous  proces¬ 
sors.  The  latency,  the  time  it  takes  for  a  given  calculation  to  complete 
this  assembly  line,  is  the  product  of  the  number  of  processors  and  the 
maximum  processor  time.  The  accuracy  or  range  of  such  a  processor 
depends  only  on  the  product  of  the  moduli  used.  If  a  processor  is 
designed  with  many  small  moduli  rather  than  a  few  large  moduli  then  the 
latency  will  be  larger  because  there  would  be  more  modular  processors. 
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The  throughput  rate  of  this  process  is  independent  of  the  accuracy.  This 
is  different  from  conventional  computational  approaches. 

The  throughput  of  this  type  of  system  is  dependent  on  the  maximum 
modular  processor  time.  For  analytical  purposes  it  was  useful  to  estab¬ 
lish  some  basic  building  blocks  for  modular  processors.  By  studying  the 
behavior  of  these  simpler  structures  the  behavior  of  the  modular  proces¬ 
sors  can  be  synthesized  and  the  overall  performance  of  such  systems 
predicted.  The  basic  building  blocks  are  shown  in  Fig.  3.  The  opera¬ 
tion  of  each  of  the  building  blocks  for  a  given  technology  can  be 
characterized  by  two  parameters.  One  is  the  set  time;  this  is  how  long 
the  unit  needs  to  get  ready.  The  other  parameter  is  the  propagation 
time;  this  is  the  time  needed  for  a  signal  to  propagate  through  the  dev¬ 
ice  once  it  has  been  set. 

The  first  unit  is  a  map  ,  i.e.  a  fixed  permutation  that  needs  no 
set  time.  The  time  needed  to  propagate  through  the  map  is  denoted  as 
t 

P- 

The  next  unit  is  a  permutation  primative .  It  either  permutes  the 
incoming  signal  or  bypasses  it.  It  can  be  viewed  as  a  switching  mechan¬ 
ism  that  directs  the  signal  to  one  of  two  fixed  maps.  The  switching 
mechanism  has  a  set  time  associated  with  it,  which  is  denoted  as  t 
The  signal  has  to  then  propagate  through  the  switching  mechanism  as  well 
as  through  one  of  the  two  maps.  This  time  is  denoted  as  t 

pp- 

The  rest  of  the  units  can  be  constructed  from  these  two  basic  dev¬ 
ices  and  thus  have  set  and  propagation  times  that  can  be  easily  derived. 

The  function  of  a  residue  adder  is  to  perform  modular  addition. 
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It's  structure  is  shown  in  Fig.  It  consists  of  a  cascade  of  permuta¬ 
tion  primatives  that  are  controlled  by  an  action  table  implemented  with 
conventional  electronic  logic.  The  action  table  for  a  modulus  7  adder 
is  shown  in  Fig.  5  and  Table  .  The  columns  of  the  table  represent  the 
maps  of  the  various  permutation  primatives.  The  rows  represent  the 
number  to  be  added.  If  the  number  to  be  added  is  5  then  that  row  of  the 
action  table  indicates  that  the  permutation  P ( X )  =  X  +  4  and  the  permu¬ 
tation  P(X)  =  X  +  1  must  be  activated.  Since  the  permutation  primatives 
are  cascaded  this  will  be  equivalent  to  a  permutation  of  P(X)  =  X  +  5. 
Such  an  action  table  is  a  simple  table  lookup  and  can  be  implemented 
with  one  level  of  logic.  The  set  time  would  thus  be  the  time  for  one 
level  of  logic,  t^ t  ancj  the  set  time  of  the  permutation  primatives,  t 
The  propagation  time  depends  on  how  many  permutation  primatives  are  cas¬ 
caded,  which  in  turn  depends  on  the  number  of  binary  bits  needed  to 
represent  the  modulus.  The  propagation  time  would  be  r  log,,  m^Upp  where 

is  the  modulus  and  the  half  brackets  indicate  the  next  larger 
integer . 

The  structure  of  a  residue  subtracter  is  identical  to  that  of  a 
residue  adder,  except  that  the  action  table  is  modified.  Such  an  action 
table  for  modulus  7  subtraction  is  shown  in  Table  5.  Modular  subtrac¬ 
tion  is  equivalent  to  addition  of  a  modular  complement.  To  subtract  a 
value  of  5  ,  the  permutation  P ( X )  =  X  +  2  must  be  activated,  since  2  is 
the  modular  complement  of  5  for  modulus  7.  The  set  and  propagate  times 
are  identical  to  those  of  a  residue  adder. 

The  structure  of  a  residue  multiplier  is  shown  in  Fig.  6.  It  con¬ 
sists  of  two  fixed  maps,  several  permutation  primatives,  and  an  action 
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FIG.  8:  EXAMPLE  OF  MODULUS  5  MULTIPLICATION 


table.  The  fixed  maps  perform  the  equivalents  of  modular  logs  and  anti¬ 
logs  [  Ref.  1,  p.  36  ]  *.  The  right-most  permutation  primative  performs 
the  permutation  P(X)  =  0  which  covers  the  case  of  multiplication  by 
zero.  The  other  permutation  primatives  provide  various  cyclic  shifts. 
The  permutations  needed  for  a  modulus  5  multiplier  are  shown  in  Fig.  7. 
The  action  table  is  shown  in  Table  6.  To  multiply  by  3  the  action  table 
directs  permutation  primatives  PP1  and  PP^  in  Fig.  8  to  be  activated. 
If  a  signal  propagates  through  map  Ml,  permutation  primatives  PP1  and 
PP2,  and  map  M2,  then  the  permutation  will  be  equivalent  to  the  desired 
permutation  of  P ( X )  =  3X.  The  set  time  of  such  a  unit  would  require  a 
time  of  t^  for  the  action  table  and  a  time  of  ts  to  set  the  required 
permutation  primatives.  The  propagation  time  varies  with  the  size  of  the 
moduli  since  this  influences  the  number  of  permutation  primatives  needed 
to  provide  the  required  shifts.  The  propagation  time  would  be 
^p  +  riog2  (mj[-1)ltpp  +  tpp* 

The  structure  of  a  binary  to  residue  encoder  is  shown  in  Fig.  9. 
It  consists  of  a  cascade  of  permutation  primatives.  If  the  number  to  be 
encoded  is  in  the  form  an2n  +  an_t2n_1  +  ...  +  ^2  +  a0,  then  the  permu¬ 
tation  of  the  permuation  primative  PPn  iS  p(x)  =  X  +  2n-  The  various 
bits  of  the  number  to  be  encoded  are  used  to  activate  the  associated 
permutation  primative.  Since  all  the  permutation  primatives  are  set  in 
parallel,  the  set  time  is  just  tgi  The  propagation  time  depends  on  how 
many  permutators  are  cascaded  which  in  turn  depends  on  the  number  of 
bits  in  the  number  to  be  encoded.  The  propagation  time  is  thus 
r 1°82  Nmax^tppi  where  Nmax  is  the  largest  number  to  be  encoded. 

•  Also  see  pages  118-121  of  Residue  Arithmetic  iml  Ilfl  Applica¬ 
tions  to  Computer  Technology  by  N.  S.  Szabo  and  R.  I.  Tanaka, 
McGraw-Hill  1967 
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FIG.  9:  BINARY  TO  RESIDUE  ENCODER 
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TABLE  7;  MODIFIED  ADDER 

ACTION  TABLE  FOR 
ENCODING 
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Encoding  can  also  be  accomplished  by  modifying  the  action  table  of 
a  residue  adder.  A  modified  action  table  to  convert  a  *4  bit  binary 
number  into  its  modulus  7  equivalent  is  shown  in  Table  7.  As  an  exam¬ 
ple,  to  encode  1010,  which  is  10  in  binary,  the  permutations  P ( X )  =  X  + 
2  and  P(X)  =  X  +  1  have  to  be  activated.  This  produces  a  permutation  of 
P(X)  =  X  +  3.  This  is  correct  since  1010  has  a  net  contribution  of  3  for 
modulus  7. 

By  properly  incorporat ing  the  effects  of  the  weighting  factors 
associated  with  the  various  digits  of  a  number,  an  action  table  can  be 
implemented  to  encode  any  n  bits  of  a  number.  Several  of  these  units 
can  be  cascaded  to  encode  all  the  portions  of  a  number.  The  set  time  is 
the  same  as  that  of  a  residue  adder.  The  propagation  time  would  depend 
on  how  many  such  units  were  cascaded. 

It  has  been  assummed  in  the  previous  discussion  of  building  blocks 
that  the  control  signals  for  the  building  blocks  were  electrical.  In 
some  of  the  subsequent  discussion  the  control  signal  will  be  optical.  An 

additional  detection  time  of  t^  seconds  would  have  to  be  added  to  the 
set  times  of  the  devices  to  represent  the  time  needed  to  convert  from  an 
optical  to  an  electrical  signal. 

These  building  blocks  can  be  assembled  to  form  modular  processors 
capable  of  solving  certain  types  of  problems. 

( 1 )  Summation  Processor 

The  structure  of  a  modular  processor  that  performs  summation  is 
shown  in  Fig.  10.  The  coefficients  of  the  desired  summation  are  tapped 
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off  the  bus  and  used  to  set  the  encoders.  The  subtractors  and  permuta¬ 


tion  primatives  form  the  mixed  radix  decoding  section  *.  They  use  the 
previously  determined  mixed  radix  coefficients  to  convert  the  present 
result  into  another  mixed  radix  coefficient.  All  the  building  blocks  can 
be  set  at  the  same  time.  The  pipelined  structure  shown  in  Fig.  2  insures 
that  all  the  coefficients  and  previously  derived  mixed  radix  coeffi¬ 
cients  are  available  to  each  modular  processor.  The  maximum  processor 
time  for  a  modular  summation  processor  would  be 

^max  =  (t's  +  t(j)  +  log2  Nmax^pp  +  -1)(Tlog2  mi"ltpp  +  tpp)  +  t<j 

=  (  set  encoders,  subtractors,  and  permutators)  + 

(  propagate  through  encoders,  subtractors,  and  permutators  )  + 

(  detect  ), 

where  Nd  is  the  nun,per  of  points  to  be  summed  and  Nm  is  the  number  of 
moduli  used.  The  throughput  rate  of  such  a  pipelined  system  would  be  the 

recriprocal  of  tmax<  This  equation  is  significant  in  that  it  connects  tne 
characteristics  of  a  technology  with  the  performance  of  a  system 
designed  to  perform  a  particular  type  of  computation.  This  connection 
will  aid  in  evaluating  the  appropriateness  of  a  particular  technology 
for  a  particular  type  of  problem. 


(2)  Innex:  Product  Processor 


The  structure  for  a  modular  processor  that  performs  inner  products 
is  shown  in  Fig.  11.  The  coefficients  of  the  desired  inner  product  are 


•  The  permutation  primatives  can  be  replaced  with  fixed  maps  if 
the  error  correcting  feature  described  below  is  not  desired.  To 
correct  an  error,  the  suspicious  result  from  a  previous  modular 
processor  is  excluded  from  the  decoding  process  by  instructing  the 
associated  subtractor  to  subtract  a  0  and  the  associated  premuta¬ 
tion  primative  to  bypass,  rather  than  multiply,  the  result  by  a 
constant.  This  operation  would  perserve  the  correctness  of  a  par¬ 
ticular  calculation  at  the  expense  of  a  reduced  range. 
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tapped  off  the  data  bus  and  used  to  set  all  the  encoders.  The  previously 
derived  mixed  radix  coefficients  on  the  bus  are  used  to  set  the  subtrac¬ 
tors  and  permutation  primatives  of  the  decoding  section.  Signals  pro¬ 
pagate  through  all  the  encoders.  The  signals  of  one  vector  are  then  used 
to  set  the  multipliers.  The  signals  associated  with  the  other  vector 
then  propagate  through  these  multipliers.  These  signals  then  set  all  the 
-  adders.  A  signal  then  propagates  through  all  the  adders  and  the  decoding 
section.  The  maximum  processor  time  for  a  modular  processor  that  per¬ 
forms  inner  products  would  be 

^max  ^s  +  ^  logp  ^max^pp  +  ^s  +  +  (2tp  +  r  log2  m^ltpp  +  tpp) 

+  ^s  +  ^i)  +  Ncjflog2  mi^  tpp  +  ( Nm- 1 )  ( T  log2  mx^pp  +  ^pp)  +  fcd 
=  (set  encoders)  +  (propagate  through  encoders)  + 

(set  multipliers)  +  (propagate  through  multipliers)  + 

(set  adders  and  subtracters)  + 

(propagate  through  adders  and  subtractors)  +  (detect). 

If  it  is  assumraed  that  t^  is  rauCh  smaller  than  tg  or  tj,  then  the 
length  of  the  inner  product  is  not  a  significant  factor.  The  processor 
time  would  then  be  dominated  by  the  three  set  times. 

(3)  Determinant  Processor 

The  evaluation  of  determinants  are  useful  in  the  computation  of 
matrix  inverses.  They  are  the  sums  and  differences  of  multiple  products. 
The  structure  of  a  modular  processor  to  evaluate  determinates  is  shvwn 
in  Fig.  12.  The  coefficients  of  the  desired  determinant  are  tapped  off 
the  data  bus  and  used  to  set  all  the  encoders.  The  previously  derived 
mixed  radix  coefficients  are  used  to  set  the  decoding  section.  Signals 
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12:  DETERMINATE  PROCESSOR 


13:  SQUARED  DISTANCE  PROCESSOR 


(  <  1 

propagate  through  all  the  encoders  in  parallel.  These  signals  are  used 
to  set  the  multipliers.  Signals  propagate  through  all  the  multipliers. 
These  signals  are  used  to  set  all  the  adders.  A  signal  then  propagates 
through  all  the  adders  and  the  decoding  section.  The  maximum  processor 
time  for  a  modular  processor  that  evaluates  determinants  would  be 

tmax  =  fcs  ♦  r log2  Nmaxltpp  +  (ts  +  ti)  +N<j(2tp  +  r log2  miltpp  +  tpp) 

+  ^s  +  ti)  +  Ndr log2  miltpp  +  (Nm-1 )(T log2  miltpp  +  tpp)  +  td 
=  (set  encoders)  +  (propagate  through  encoders)  + 

(set  multipliers)  +  (propagate  through  multipliers)  + 

(set  adders  and  subtractors)  + 

(propagate  through  adders  and  subtractors)  +  (detect), 
where  iS  the  dimension  of  the  determinate. 

This  processor  time  is  approximately  equal  to  that  of  the  inner 
product  processor.  It  is  also  dominated  by  three  set  times.  The  only 
difference  is  that  in  this  processor,  a  signal  has  to  propagate  through 
several  multipliers  rather  than  just  one  as  in  the  case  of  inner  pro¬ 
ducts.  The  larger  the  matrix  involved  the  more  multipliers  have  to  be 
cascaded.  If  the  propagation  time,  t^^  is  assumed  to  be  much  smaller 
than  that  of  the  set  time,  tg>  it  can  pe  seen  that  the  size  of  the 
matrix  does  not  strongly  influence  the  processor  time. 

(4 )  Squared  Vjgctur  Distance  Processor 

The  sum  of  the  squares  of  the  differences  of  two  vector  components 
is  used  as  a  distance  evaluator  in  many  optimizing  or  decision  algo¬ 
rithms.  A  modular  processor  to  compute  such  squared  distance  is  shown  in 
Fig  13.  The  unique  feature  is  that  the  signals  propagate  through  a 
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fixed  map  before  they  are  added.  This  map  performs  the  polynomial 
transform  P(X)  =  X2.  This  example  is  used  to  demonstrate  that  any  poly¬ 
nomial  with  integer  coefficient  and  exponents  can  be  used  in  these  pro¬ 
cessors,  The  maximum  processor  time  for  a  modular  processor  that  com¬ 
putes  such  distances  would  be 

''max  =  t3  +  r log2  NmaxTtpp  +  (tg  +  ti)  +  flogp  miltpp  + 

+  (ts  ♦  ll)  +  Ndr  log2  miltpp  +  (Nm-1)(Uog2  miltpp  +  tpp)  +  td 
=  (set  encoders)  +  (propagate  through  encoders)  + 

(set  subtracters)  +  (propagate  through  subtracters)  + 

(propagate  through  map)  ♦  (set  adders  and  subtractors)  -*■ 
(propagate  through  adders  and  subtractors)  +  (detect). 


As  mentioned  previously,  the  expressions  for  maximum  processor  time 
derived  for  the  various  modular  processors  provide  a  means  of  evaluating 
the  effects  of  certain  technological  approaches  to  certain  types  of  com¬ 
putation  problems. 


As  an  example,  suppose  that  the  maps  were  implemented  with  wire 
interconnections  and  the  permutation  primatives  were  constructed  using 
field  effect  transistors  (  FET's  )  as  switching  devices  as  shown  in  Fig. 
1*1.  As  discussed  previously,  the  other  building  blocks  can  be  con¬ 
structed  from  these  basic  elements.  If  the  set  time,  t  is  assummed  to 
be  10  nanoseconds  and  the  propagation  time,  t  0f  each  building  block 
is  assummed  to  be  5  ns.  then  the  throughput  rate  for  inner  products  of 
vectors  with  100  elements  of  arbitrary  accuracy  would  be  approximately 
r  =  1/[  3(  10  ns  )  +  1 00 (  5  ns  )] 
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This  would  be  equivalent  to  about  2  million  inner  products  a  second. 
Since  each  inner  product  consists  of  100  multiplications  and  102  addi¬ 
tions  this  is  equivalent  to  202  x  2  x  10®  or  about  *400  million  arith¬ 
metic  operations  a  second. 

The  performance  of  an  optical  version  of  the  same  processor  can  be 
examined  in  a  similar  manner.  If  Bragg  coupler  technology  is  considered, 
then  a  set  time  for  each  building  block  of  30  ns  and  a  propagation  time 
of  200  ps  can  be  assumed.  This  would  result  in  a  throughput  of  approxi¬ 
mately 

r  =  1/[  3(  30  ns  )  +  100(  200  ps  )] 

=  9  mhz. 

This  rate  represents  9  million  inner  products  a  second,  which  is  is 
equivalent  to  202  x  9  x  10®  =  1,800  million  arithmetic  operations  a 
second. 

The  high  throughput  of  residue  processors  used  as  examples  is  not 
due  to  the  technology  but  rather  the  architecture.  The  set  times  of  10 
and  30  ns  are  quite  slow  compared  to  the  switching  times  of  present 
electronic  logic.  The  throughput  is  due  to  both  a  mathematical  and 
structural  advantage.  Mathematically,  not  having  to  deal  with  carries 
greatly  speeds  up  addition  and  multiplication.  Structurally ,  the  proces¬ 
sor  is  highly  parallel  and  very  effectively  pipelined. 
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D.  Mtfdular  Prgcesstfrs  Pipelined  Bank;?  (  Jig.  Turtle  ) 

Modular  Processors  can  also  be  pipelined  by  banks.  This  approach  is 
referred  to  as  the  "turtle"  or  as  a  parallel  number  theoretic  processor. 
Its  structure  favors  different  technological  traits.  The  rabbit 
approach  relies  on  speed  while  the  turtle  approach  relies  on  parallelism 
to  achieve  large  throughputs. 

The  overall  structure  of  such  processors  is  shown  in  Fig.  15.  The 
data  to  be  processed  is  fed  to  the  modular  encoders.  The  encoded  values 
are  then  given  to  modular  processors.  The  modular  results  are  then  fed 
to  a  residue-to-mixed-radix  converter.  The  mixed  radix  equivalent  is 
then  given  to  a  mixed-to-normal  radix  converter  that  produces  a  normal 
radix  equivalent  of  the  answer  to  the  desired  computation. 

The  processor  relies  entirely  on  table  lookup.  The  tables  can  be 
implemented  with  read  only  memories.  A  modular  5  addition  table  is  shown 
in  Table  8.  Neither  the  operands  nor  the  sum  ever  exceed  4  in  value. 
The  operands  and  sum  can  thus  each  be  expressed  with  3  bits.  This  table 
can  be  implemented  with  a  read  only  memory  by  combining  the  3  bits  of 
each  operand  to  form  an  address  and  storing  at  that  location  a  3-bit 
representation  of  the  sum.  Such  a  table  is  shown  in  Table  9.  As  an 
example,  to  add  4  and  3  these  operands  are  first  expressed  in  binary  as 
100  and  Oil.  These  are  combined  to  form  the  address  10001  1.  Stored  at 
this  location  in  the  memory  shown  in  Table  9  is  010,  which  is  the 
modulus  5  equivalent  of  the  sum  of  4  and  3. 

Similar  tables  can  be  constructed  to  perform  modular  subtraction, 


multiplication  ,  and  polynominal  transforms  for  any  modulus. 
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TABLE  9:  ROM  VERSION  OF  MODULUS  5 
ADDITION 
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( i )  Binary  io  Residue 


Tables  to  perform  encoding  from  binary  to  residue  notation  can  be 
performed  by  tables  constructed  using  three  basic  strategies. 

One  approach  is  by  direct  table  lookup  as  shown  in  Fig.  16(a).  The 
number  to  be  encoded  is  used  as  the  address  while  the  content  at  this 
location  is  the  desired  modular  equivalent.  This  approach  is  only  prac¬ 
tical  for  encoding  relatively  small  numbers  (  less  than  10  bits  ). 

A  second  method  involves  a  cascaded  approach  as  shown  in  Fig. 
16(b).  The  bottom  most  node  performs  a  direct  table  lookup  to  encode  a 
portion  of  the  number  while  the  subsequent  nodes  encode  other  portions 
and  adds  the  equivalent  of  this  new  portion  in  a  modular  manner  to  the 
result  from  the  previous  node.  This  chaining  process  can  be  extended  to 
incorportate  any  number  of  bits. 

A  final  method  involves  a  parallel  approach  as  shown  in  Fig.  16(c). 
Portions  of  a  number  are  encoded  by  direct  table  lookup.  The  equivalents 
of  these  portions  are  then  added  together  in  a  modular  manner.  This 
method  can  be  extended  to  incorporate  any  number  of  bits. 

These  three  basic  approaches  to  encoding  can  be  modified  to  include 
the  encoding  of  negative  numbers  represented  in  either  sign  magnitude  or 
two's  complement  format. 

Once  the  data  has  been  encoded  into  their  modular  equivalents  it 
can  be  used  to  perform  computations  in  a  modular  manner. 

(2)  Summation  Processor 
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fig.  16a: 
DIRECT  ENCODING 


FIG.  16c; 
PARALLEL  ENCODING 


FIG.  16b 

CASCADED  ENCODING 


The  structure  of  a  modular  processor  that  performs  summation  for  a 
particular  modulus  is  shown  in  Fig.  17.  Each  of  the  nodes  represents  a 
read-only  memory.  The  data  entering  the  bottom  of  a  node  is  used  to 
form  an  address.  The  content  of  the  node  at  that  particular  address 
emerges  at  the  top  of  the  node.  If  the  modular  equivalents  of  the  16 
numbers  to  be  summed  are  fed  to  the  bottom  row  of  nodes  then  the  modular 
equivalent  of  their  sum  will  eventually  emerge  from  the  root  node  (  top 
most  node  ) .  Such  a  processor  can  be  constructed  for  any  modulus  by  pro¬ 
gramming  the  ROMs  in  the  proper  manner. 

(3)  Inner  Product  Processor 

A  modular  processor  that  performs  inner  products  is  shown  in  Fig. 
18.  The  boxes  represent  encoders,  the  nodes  with  an  X  represent  modular 
multiplication  ROMs,  and  the  node  with  a  D  represents  a  delay  node  *. 
The  modular  equivalents  of  the  vectors  to  be  processed  are  fed  to  the 
bottom  row  of  nodes.  The  modular  eqivalent  of  the  inner  product  will 
emerge  from  the  root  node. 


(4)  Determinant  Processor 

A  modular  processor  that  evaluates  determinants  is  shown  in  Fig. 
19.  The  main  difference  from  the  inner  product  processor  is  that  more 
multiplier  nodes  are  needed.  The  modular  equivalents  of  the  matrix  coef¬ 
ficients  are  fed  to  the  bottom  row  of  nodes.  The  modular  equivalent  of 
the  determinate  will  emerge  from  the  root  node. 


*  A  delay  node  duplicates  the  bits  of  its  address  on  its  output  at 
a  later  time.  This  delay  maintains  synchronization  between  the 
various  portions  of  a  calculation  so  that  they  merge  correctly. 


35 


MODULAR  INNER  PRODUCT  PROCESSOR 
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(5)  Squared  Distance  Processor 

A  modular  Processor  that  computes  the  sum  of  the  squares  of  the 
differences  between  the  components  of  two  vectors  is  shown  in  Fig.  19.5 
The  nodes  denoted  with  a  P  perform  the  integer  polynorainal  transform 
P(X)  =  (X-Y)^  *.  The  modular  equivalents  of  the  vectors  are  fed  to  the 
bottom  row  of  nodes.  The  modular  equivalent  of  the  sum  of  the  squares  of 
the  difference  of  the  vectors  will  emerge  from  the  root  node. 

(6)  Other  Computations 

To  generalize,  a  modular  processor  can  be  designed  to  perform  any 
combination  of  additions,  subtractions,  multiplications,  or  integer 
polynominal  transforms.  The  structure  of  such  a  processor  follows 
directly  from  the  expression  evaluation  tree  of  the  desired  computation. 
The  expression  evaluation  tree  of  the  inner  product  processor  in  Fig.  18 
is  shown  in  Fig.  18.1.  The  only  difference  is  that  a  delay  node  has  been 
inserted  to  equalize  the  terminal  path  lengths  to  insure  proper  syn¬ 
chronization  of  the  different  portions  of  the  computation. 

( 7 )  Conversion  From  Residues  To  Mixed  Radix 

A  given  computation  is  done  in  a  modular  manner  by  several  dif¬ 
ferent  modular  processors.  These  modular  results  are  then  rewoven 
together  to  construct  a  mixed  radix  version  of  the  answer.  This  mixed 
radix  conversion  can  be  performed  by  various  integer  polynomial 
transforms  [Ref.  1,  p  17-25].  The  structure  of  such  a  converter  is 

*  Other  processors  can  be  designed  using  more  complex  integer  po¬ 
lynominal  transforms.  The  tables  required  are  no  more  complex  that 
those  required  for  addition. 
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FIG. 19.5:  MODULAR  VECTOR  DISTANCE  PROCESSOR 
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FIG.  20:  RESIDUE  TO  MIXED  RADIX 
CONVERTER 
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shown  in  Fig.  20.  The  nodes  on  the  bottom  row  represent  the  root  nodes 
of  three  modular  processors.  The  nodes  denoted  with  a  P  perform  the 
required  polynominal  transforms.  Delay  nodes  temporarily  store  the 
coefficients  as  they  are  produced.  What  emerges  are  the  mixed  radix 
coefficients  of  the  answer  to  the  desired  computation. 

(8)  Conversion  From  A^  Mixed  To  _A  Normal  Radix 

The  mixed  radix  version  of  the  answer  is  sufficient  for  sign  and 
relative  magnitude  determination.  In  some  situations  it  is  desirable  to 
further  convert  the  mixed  radix  into  a  normal  radix  number.  An  overview 
of  such  a  converter  is  shown  in  Fig.  21.  Each  value  of  each  mixed  radix 
coefficient  has  a  normal  radix  equivalent.  These  equivalents  are 
recalled  from  storage  and  added  to  produce  a  normal  radix  equivalent.  To 
simplify  this  addition,  a  carry  save  adder  strategy  is  employed.  This 
reduces,  without  carries,  the  equivalents  to  be  added  to  only  two.  A 
carry  propagate  adder  is  then  used  to  add  this  final  pair.  The  storage, 
carry  save  adders,  and  carry  propagate  adder  can  be  implemented  in  a  bit 
slice  format  with  read-only  memories. 

The  structure  of  the  storage  section  is  shown  in  Fig.  22.  The 
nodes  on  the  bottom  row  are  the  output  nodes  of  the  residue  to  mixed 
radix  converter  shown  in  Fig.  20.  The  mixed  radix  coefficients  are  dis¬ 
tributed  to  various  storage  nodes,  denoted  with  an  S,  that  store  multi¬ 
bit  slices  of  the  normal  radix  equivalents.  The  top  left  three  nodes 
store  bits  11  through  8,  7  through  4,  and  3  through  0  of  the  equivalent 
of  the  mixed  radix  coefficient  The  other  nodes  store  multibit  slices 
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FIG.  21:  OVERVIEW  OF  MIXED  TO  NORMAL 
RADIX  CONVERTER 
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FIG.  22:  STORAGE  OF  NORMAL  RADIX 
EQUIVALENTS 
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of  the  sum  of  the  equivalents  associated  with  the  mixed  radix  coeffi¬ 
cients  a1  and  a2>  This  pairing  and  pre-addition  technique  reduces  both 
the  storage  that  is  needed  and  the  number  of  equivalents  to  be  added. 

The  carry  save  adder  section,  which  reduces  a  3  number  sum  into  a  2 
number  sum  in  parallel  without  using  carries,  is  shown  in  Fig.  23.  The 
nodes  denoted  with  an  A  perform  conventional  binary  addition  on  multibit 
operands.  These  adders  are  implemented  with  tables.  A  table  for  2  bit 
operands  is  shown  in  Table  T 1 5  of  the  appendix.  Each  adder  node  han¬ 
dles  a  2  bit  slice  from  each  of  the  three  numbers  to  be  added.  It  pro¬ 
ducts  a  2  bit  sum  and  a  2  bit  carry.  The  sum  bits  from  all  the  adders 
are  combined  to  form  ot.e  number  while  all  the  carry  bits  form  another 
number.  Two  zero  bits  are  padded  onto  the  number  synethized  from  the 
carry  bits  to  perserve  the  proper  significance  of  these  carry  bits.  A 
sum  of  3  numbers  can  thus  be  reduced  to  a  2  number  sum.  Several  such 
carry  save  adders  can  be  used  to  reduce  a  sum  of  many  numbers  to  only  a 
sum  of  two  numbers. 

The  remaining  2  numbers  are  added  with  a  carry  propagate  adder.  A 
carry  propagate  adder  is  shown  in  Fig.  24.  The  two  numbers  are  added  in 
multibit  slices  from  least  to  most  significant  slice.  This  allows  a 
carry  to  propagate  between  the  slices.  The  nodes  denoted  with  an  A  are 
the  same  binary  adders  used  in  the  carry  save  adder.  The  nodes  denoted 
with  a  D  are  delay  nodes  to  delay  slices  of  the  operands  until  they  are 
needed.  Delay  nodes  are  also  used  to  delay  slices  of  the  sum  so  that 
they  emerge  in  synchronization. 

(9)  Synchronous  And  Asynchronous  Turtle  Processors 
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An  overview  of  a  turtle  processor  designed  to  perform  summation  is 
shown  in  Fig.  25.  The  bottom  row  of  nodes  consists  of  encoders.  The 
next  two  rows  consists  of  adders.  Each  of  three  clusters  of  nodes  at 
the  bottom  represent  different  modular  processors.  The  fourth  and  fifth 
row  form  the  residue-to-mixed  radix  converter.  The  sixth  row  contain 
the  storage  units  for  the  mixed-to-normal  radix  converter.  Since  in  this 
simple  example  there  are  only  two  normal  radix  equivalents  to  be  added 
a  carry  save  adder  section  is  not  necessary.  The  top  6  rows  of  nodes 
form  a  carry  propagate  adder. 

This  version  of  the  processor  is  designed  for  syrchronious  opera¬ 
tion.  The  ROMs  used  in  the  processor  must  either  be  input  or  output 
latchable.  The  data  to  be  processed  is  fed  to  the  bottom  row  of 
encoders.  On  a  clock  signal  these  memories  are  read  and  the  results  are 
used  as  the  inputs  for  the  next  row  or  ROMs.  The  data  for  another  com¬ 
putation  is  then  fed  to  the  encoders.  On  each  subsequent  clock  cycle  the 
data  of  a  particular  calculation  proceeds  to  s-  '°quent  rows  of  the  pro¬ 
cessor  in  a  Roman-phalanx-  like  manner.  The  results  of  each  calculation 
will  eventually  emerge  from  this  pipelined  processor.  The  throughput 
rate  would  be  the  reoriprocal  of  the  ROM  cycle  time. 

A  version  of  the  processor  can  also  be  designed  for  asynchronous 
operation.  Such  a  structure  is  shown  in  Fig.  2 6.  In  this  case  the  ROMs 
are  not  latachable.  The  data  for  a  particular  computation  is  placed  at 
the  encoders.  The  results  just  propagate  through  the  system.  There  are 
many  races  but  since  there  is  no  feedback  or  memory  in  the  processor 
none  of  the  races  are  critical.  The  answer  is  derived  in  a  Darwinian 
manner.  The  unit  can  be  viewed  as  one  large  combinatoric  circuit. 


t  ! 

Any  computation  involving  any  combination  of  additions,  subtrac¬ 
tions,  multiplications,  or  integer  polynominal  transforms  can  thus  be 
accomplished  with  a  combinatoric  circuit. 

(10)  Example  Q1  A  Turtle  Processor 

As  an  example  of  how  such  turtle  processors  would  operate,  the 
tables  representing  each  of  the  nodes  shown  in  Fig.  25  and  26  are 
included  as  tables  in  the  appendix.  An  example  of  the  sum  of  13  +  (-11) 
+  7  +  (-17)  is  shown  in  Table  10.  In  six  bit  two's  complement,  this  sum 
is  expressed  as  001  101  +  110101  +  0001  1  1  +  101  1  11.  Table  T5  is  used  to 
translates  these  values  into  their  modulus  5  equivalents  of  011,  100, 
010,  and  011.  011  and  100  are  then  combined  to  form  the  address  of 
011100  which  is  translated  by  table  T1  into  the  modular  sum  of  010.  010 
and  011  are  summed  in  the  same  manner  to  produce  000.  These  two  results 
are  then  summed  with  table  T1  to  produce  010.  Tables  T6  and  T2  perform 
this  same  procedure  for  modulus  7.  The  result  is  110.  Tables  T7  and  T3 
perform  this  same  procedure  for  modulus  8.  The  result  is  000. 

The  modulus  5,  7,  and  8  results  of  010,  110,  and  000  are  converted 
to  mixed  radix  by  tables  T8,  T9,  and  T10.  The  modulus  5  and  7  results 
are  combined  to  form  the  address  of  010110.  Table  T8  translates  this 
into  101.  The  modulus  7  and  8  results  are  translated  by  table  T9  into 
110.  These  results  are  then  combined  to  form  an  address  of  101110, 
which  is  translated  by  table  T10  into  HI.  This  produces  the  mixed  radix 
coefficient  aQ  .  o  1 0 ,  ai  =  101,  and  ap  =  111.  Table  Til  then  produces 
the  normal  equivalent  of  aQ(  which  is  0010.  Tables  T12,  T 1 3 ,  and  T 1  *4 
produce  slices  of  the  normal  radix  equivalent  associated  with  the  pair 
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TABLE  10:  EXAMPLE  OF  NUMBER 
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of  coefficients  and  32,  The  result  is  111111110110.  These  two 
normal-radix  equivalents  are  then  added  with  binary  adder  nodes,  as 
represented  by  Table  T15.  The  least  significant  slice  of  this  sum  is 
00,  10,  and  10.  The  00  is  the  result  of  a  null  carry  in  for  the  least 
significant  slice.  These  operands  form  an  address  of  001010.  Table  T15 
translates  this  into  0100.  The  first  two  bits  are  used  as  a  carry  slice 
while  the  last  two  bits  form  the  sum  slice.  The  carry  is  used  as  an 
operand  in  the  next  more  significant  slice.  The  operands  of  this  slice 
are  01,  01,  and  00.  Table  T 1 5  produces  a  result  of  0010.  00  is  used  as  a 
carry  while  10  is  used  as  the  sum.  This  process  is  continued  for  the 
other  slices.  The  result  is  111111111000,  which  is  -8  in  12  bit  two's 
complement.  This  is  the  result  of  the  desired  sum  of  13  +  (-11)  +  7  + 
(-17). 

(11)  Accuracy 

The  processor  is  constructed  entirely  from  64  by  4  bit  ROMs.  It  has 
a  range  of  —  1 4 0  to  139.  By  using  larger  ROMs  larger  moduli  can  be 
represented  and  thus  larger  ranges  can  be  achieved. 

If  1024  by  6  bit  ROMs  are  used,  then  the  moduli  32,  29,  27,  25,  23, 
19,  17,  13,  11,  and  7  can  be  used  giving  a  range  of  144,403,552,893,600 
which  is  about  2^7. 

If  4096  by  6  bit  ROMs  are  used,  then  the  moduli  64,  61,  59,  57,  53, 
51,  49,  47,  43,  41,  37,  31,  29,  25,  23,  19,  17,  13,  and  11  can  be  used. 
This  would  give  a  range  of  127,290,734,521,737,197,468,723,265,600  or 
about  296.  Larger  ROMs  can  be  used  to  achieve  even  greater  ranges  if 
desired . 
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The  concept  of  range  is  different  in  a  residue  number  system.  An 
intermediate  computation  may  exceed  this  range.  It  is  only  necessary 
for  the  final  answer  to  be  within  the  range  before  it  is  converted  into 
a  mixed  radix  number.  In  the  literature  this  property  is  called  "com¬ 
pute  through  overflow".  Thus  the  range  need  only  be  sufficient  to 
represent  the  answer.  This  can  greatly  reduce  the  range  required  for 
certain  types  of  computations. 

(12)  Throughput 

A  processor  designed  to  compute  inner  products  of  100  element  vec¬ 
tors  where  each  element  is  20  bits  long,  can  be  constructed  entirely  out 
of  1024  by  6  bit  ROMs.  If  the  cycle  time  of  each  ROM  is  300  nanoseconds 
then  3-3  million  inner  products  can  be  performed  a  second  (  1 / C 300  ns.] 
).  Since  each  inner  product  consists  of  100  multiplication  and  102  addi¬ 
tions  this  is  equivalent  to  663.3  million  arithmetic  operations  a 
second.  The  latency  of  each  inner  product  would  be  12.3  microseconds . 

A  processor  using  the  same  ROMs  can  be  designed  to  perform  inner 
products  on  1000  element  vectors,  where  each  element  is  18  bits,  at  the 
same  throughput  rate  of  3.3  million  inner  products  a  second.  In  this 
case  each  inner  product  consists  of  1000  multiplications  and  1013  addi¬ 
tions.  This  would  be  a  throughput  of  6.71  x  109  arithmetic  operations  a 
second.  The  latency  would  be  13.5  microseconds  * . 

(13)  System  £abcifiaUfln 

•  As  a  rough  reference  of  throughput  an  IBM  370  can  process  about 
4  million  instructions  per  second.  Processors  such  as  the  CRAY  -  1 
and  the  ILL.IAC  IV  can  do  up  to  100  million  instructions  per  second 
in  short  bursts. 
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These  processors  can  be  constructed  in  a  modular  manner.  A  large 
portion  of  the  design  of  a  conventional  processor  is  consummed  with  the 
layout.  The  typical  procedure  involves  partitioning  the  circuit,  laying 
out  the  boards  and  specifying  the  blackplane  interconnections.  The  tur¬ 
tle  processor  is  constructed  entirely  with  2  input  and  1  output  nodes,  3 
input  and  2  output  nodes,  and  their  interconnections.  The  nodes  and 
interconnection  cables  can  be  mass  manufactured.  The  basic  modules  are 
shown  in  Fig.  27.  The  top  two  modules  represent  different  types  of 
nodes.  On  the  bottom  row  the  1  input,  2  output  module  is  a  forked  cable 
which  duplicates  a  signal.  The  1  input,  1  output  module  is  an  amplifier. 
Standardized  cables  would  carry  the  information  signals  as  well  as  util¬ 
ities,  such  as  the  clock  signal  and  the  power  between  the  modules.  The 
construction  of  such  processors  is  reduced  to  specifying  the  pattern  of 
the  nodal  interconnections  and  the  programming  of  the  ROM  of  each  node. 
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E.  What  Has  All  This  To  Do  With  Optics? 

The  turtle  processor  performs  many  table  lookups  in  parallel  during 
each  cycle.  Optical  methods  capable  of  performing  many  table  lookups  in 
parallel  can  be  conceived.  The  fundamental  device  in  such  an  optical 
processor  would  be  an  optical  ROM.  It  is  important  for  such  a  device  to 
produce  an  output  that  can  be  used  as  an  input  of  another  such  ROM.  In 
other  words,  it  should  "produce  what  it  eats".  In  optics  the  basic 

informati  n-carrying  mechanisms  are  intensity  ,  phase,  frequency,  or 
spatial  distributions  of  these  quantities.  A  ROM  based  on  intensity 
would  have  to  have  as  inputs  various  intensities  and  produce  a  predeter¬ 
mined  output  intensity.  No  convienent  implementation  based  on  either 

intensity,  phase,  or  frequency  has  yet  been  found. 

( 1 )  Optical  ROMs 

The  most  promising  approach  for  an  optical  ROM  is  based  on  embed¬ 
ding  information  on  the  spatial  domain.  The  input  operands  of  such  a 
device  would  be  intensity  distributions  of  light,  I(x,y),  and  the  output 

of  the  ROM  would  also  be  an  intensity  distribution  of  light  that  could 

be  used  as  an  input  operand  for  another  such  ROM. 

A  block  diagram  of  an  optical  ROM  is  shown  in  Fig.  28.  Two  or  more 
images,  ^in(x,y),  would  be  ANDed  together  to  form  an  uniaue  image, 

^addrCx.y).  This  image  would  be  used  to  address  an  associative  memory  to 
produce  an  output  image,  ^ou^(x,y).  This  image  could  then  be  used  as  the 
input  for  another  such  ROM. 

(2)  Optical  Addr&ssing 
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FIG.  27:  MODULES  FOR  NUMBER  THEORETIC 
PROCESSORS 
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FTG.  28:  BLOCK  DIAGRAM  OF  AN  OPTICAL  ROM 


The  fundamental  operation  of  locating  or  addressing  anything  any¬ 
where,  whether  it  is  in  a  computer  memory  or  on  a  road  map,  is  the  "AND" 
operation.  An  address  of  Oil  in  a  computer  means  that  the  2^  bit  of  the 
address  is  0  and  the  2'  bit  is  a  1  and  the  2®  bit  is  a  1.  Unfortunately, 
optical  ANDs  cannot  be  performed  in  a  convienent  manner;  however  an  opt¬ 
ical  "OR"  can. 

If  two  or  more  binary  images,  I(x,y),  are  projected  on  a  common 
surface  and  thresholded,  then  the  result  will  be  the  union  or  "OR"  of 
the  two  images,  as  shown  in  Fig.  29.  With  the  help  of  some  negations  ( 
contrast  reversals  ) ,  the  ORs  can  be  used  to  perform  ANDs  by  means  of 
DeMorgan's  law  ,  A  AND  B  =  NOT (  NOT  A  OR  NOT  B  ) ,  as  shown  in  Fig.  29. 
The  images  to  be  ANDed  are  first  inverted  in  contrast.  These  images  are 
then  ORed  by  projecting  them  on  a  common  surface  and  threshol ding .  The 
image  on  the  surface  .  *  rr^r  inverted  m  contrast.  This  results  in  the 
AND  of  the  orginal  image". 

It  is  import  a- .  •  ■  v,  •  •  icp.'i'  images  form  a  unique  out¬ 
put  image.  *  ••  . . -a  w  •; ;  1  'on  fuse  the  memory.  This 

uniqueness  *ar  re  ,  .  •  •  •  •  e  .rpu‘  images  as  shown  in  Fig. 
30(a).  Suppose  •••  .  dark  bars  could  each 
only  be  in  one  of  ;..•••  - .  •  '  •  .r.-ersion  of  the  OP  of  any  of  the 
possible  vertical  pi  *  •  f  *  n«  possible  horizontal  patterns 
will  result  in  a  unique  ;  i"“t  r  Tr.s  pi' tern  happens  to  be  a  point 
source  that  will  provide  a  -orvienent  address  image  for  an  associative 
memory.  The  five  vertical  and  five  horizontal  patterns  are  sufficient 
to  each  represent  modulus  5  operands. 


Images  of  the  form  of  a  dark  vertical  or  horizontal  bars  are 
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FIG.  30(a):  UNIQUE  INPUT  IMAGES 


FIG.  30(b):  UNIQUE  CYCLIC  IMAGES 


FIG.  30(c):  OUTPUT  IMAGES 


suitible  as  inputs  for  any  two  input  ROMs.  This  type  of  ROM  is  suffi¬ 
cient  to  construct  encoders,  modular  processors,  and  the  residue-to- 
mixed  radix  converter.  If  it  is  desired  to  completely  convert  the 
result  back  to  a  normal  radix,  then  3  input  and  2  output  ROMs  would  be 
required.  This  necessitates  a  larger  variety  of  input  and  output 
images.  Such  images  are  shown  in  Fig.  30(b).  They  have  a  cyclic  struc¬ 
ture.  If  the  frequencies  of  the  images  are  pairwise  relatively  prime, 

then  the  OR,  and  negation  and  thresholding  of  such  images  will  also  form 
a  unique  point  source  image  suitable  for  use  as  an  address. 

To  generate  the  address  image,  the  inversion  of  the  OR  of  the  input 
images  has  to  be  accomplished.  Several  physical  mechanisms  can  be  used 
to  accomplished  this.  One  method  uses  the  Hughes  Liquid  Crystal  Light 
Valve . 

One  surface  of  the  light  valve  is  light  sensitive.  A  distribution 
of  light  on  this  surface  will  modulate  the  electric  field  across  the 
liquid  crystal.  This  electric  field  in  turn  modulates  the  birefringence 
of  the  liquid  crystal.  This  change  in  birefringence  rotates  the  axis  of 
polarization  of  a  polarized  read  light  beam  impinging  on  the  other  sur¬ 
face.  The  reflected  light  is  analyzed  with  a  polarizer.  Depending  on  the 
orientation  of  this  analyzer  the  resulting  image  will  either  have  normal 
contrast  or  reversed  contrast  when  compared  to  the  image  on  the  input 
surface.  The  light  valve  would  be  used  in  a  binary  mode  to  achieve 
thresholding.  It  is  assummed  that  the  bright  portions  of  each  input 
image  would  be  sufficient  to  saturate  the  photoconductor . 

(3)  Optical  Storage 
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The  associative  memory  of  an  optical  ROM  must  associate  an  address 


image  with  an  output  image.  Such  a  memory  can  be  holographic.  The  com¬ 
plexity  of  the  memory  is  considerably  simplified  by  using  address  images 
that  are  unique  point  sources.  The  output  images  to  be  associated  with 
a  point  source  address  image  are  of  the  form  shown  in  Fig.  30(c).  If 
only  mixed  radix  results  are  desired,  then  images  of  vertical  or  hor¬ 
izontal  dark  bars  would  be  sufficient.  If  conversion  to  a  normal  radix 
is  desired,  then  images  with  a  cyclic  or  point-source  pattern  would  be 
necessary. 

The  operation  of  such  a  holographic  ROM  is  shown  in  Fig.  31.  Two 
or  more  input  images  would  be  projected  on  to  the  surface  of  a  Hughes 
Liquid  Crystral  Light  Valve.  This  image  would  then  be  inverted  in  con¬ 
trast  to  form  a  point  source  address  image.  This  point  source  would 
address  the  hologram  to  produce  an  output  image. 

(4)  System  InteKration 

Many  of  these  ROMs  can  be  placed  side  by  side  on  a  common  surface. 
Each  portion  of  the  surface  denoting  a  ROM  would  OR  its  own  input 
images.  All  the  ORs  of  all  the  ROMs  would  be  inverted  at  the  same  time. 
Each  ROM  will  then  have  an  appropriate  address  image  for  its  correspond¬ 
ing  hologram.  Rather  than  having  a  mosaic  of  holograms,  the  holograms 
for  all  the  ROMs  can  be  amalgamated  into  one  large  hologram.  This  can 
be  done  since  the  address  images  of  each  ROM  remains  spatially  unique 
even  though  the  ROMs  are  placed  side  by  side.  Each  point  source  address 
image  will  produce  an  output  image.  Spatial  offsets  can  be  incorporated 
into  these  output  images.  The  output  images  can  be  positioned  to  be  the 
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input  images  of  other  optical  ROMs.  The  holograms  would  thus  not  only 
store  the  information  but  also  distribute  it. 

In  order  to  construct  an  optical  turtle  processor,  the  output  of 
the  ROMs  must  be  used  as  inputs  of  other  ROMs.  What  is  desired  concep¬ 
tually  is  shown  in  Fig.  32.  The  output  of  the  ROMs  would  be  delayed  a 
sufficient  amount  of  time  to  allow  the  inversion  mechanism  to  recycle. 
These  delayed  output  images  would  then  be  used  as  input  images.  Unfor¬ 
tunately  the  slow  recycle  time  of  the  inversion  mechanism  makes  this 
approach  impractial. 

An  approach  using  a  twin  set  of  ROMs  is  shown  in  Fig.  33.  One 
group  of  ROMs  is  read  and  used  to  provide  the  input  to  the  other  group, 
and  vice  versa.  The  light  passing  through  the  hologram  on  the  top  left 
produces  input  images  for  all  the  ROMs  on  the  light-sensitive  side  of 
the  light  valve.  A  polarized  read  light  reflected  from  the  light  valve 
and  analyzed  by  another  polarizer  produces  a  contrast  inverted  image  of 
the  entire  surface.  Theke  point  sources,  reflected  by  a  mirror,  are  used 
to  address  the  hologram  on  the  bottom  row.  This  produces  the  input 
images  for  all  the  ROMs  on  the  bottom  light  valve.  Another  read  light  is 
used  to  generate  the  inverse  of  this  image,  which  produces  the  address 
images  for  the  hologram  on  the  top  row. 

The  two  ROMs  with  their  light  valves  are  organized  in  a  two  cycle 
approach.  One  light  valve  is  used  to  write  the  other,  and  then  vice 
versa.  The  delay  or  latency  of  the  light  valve  is  used  as  temporary 
storage.  This  keeps  the  system  going,  much  like  how  a  flywheel  is  used 
in  a  two  cycle  engine.  (  This  configuration  is  referred  to  as  a  "torus 
turtle".  )  A  three  cycle  or  Wankel  approach  using  3  ROM  banks  c.^n  be 
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FIG.  32:  OPTICAL  TURTLE  USING  DELAY 


Hughes  Read  Light 


LCLV 


FIG.  33:  OPTICAL  TURTLE  USING  TWIN  ROMS 


considered  if  the  latency  is  not  sufficient .  This  approach  can  be 


extended  to  n  cycles,  as  might  be  necessary  to  accommodate  the  long 
recovery  time  of  the  inverting  mechanism  in  some  technologies . 

The  optical  processor  as  shown  in  Fig.  33  can  be  simplified  by 
using  reflective  holograms  to  replace  the  mirrors  as  shown  in  Fig.  3^ • 
These  reflective  holograms  can  be  constructed  to  be  oriented  at  an  arbi¬ 
trary  angle.  This  would  lead  to  a  further  simplification  as  shown  in 
Fig.  35.  This  sandwich  structure  would  be  simpler,  more  stable,  and 
more  compact. 

In  all  the  optical  turtle  processor  configurations  the  input  data 
is  represented  by  an  image  that  is  projected  on  a  portion  of  the  OFing 
surface  of  a  light  valve.  The  output  is  represented  by  the  image  that  is 
reflected  from  a  portion  of  the  ANDing  surface  of  the  light  valve.  LED's 
and  optical  masks  can  be  used  to  translate  conventional  input  data  into 
images  of  the  form  shown  in  Fig.  30(a).  The  output  images  of  the  form 
shown  in  Fig.  30(c)  can  be  translated  by  photodetectors  into  conven¬ 
tional  electronic  output. 

The  question  remains  as  to  how  many  ROMs  are  possible.  This  depends 
on  the  storage  capacity  of  the  hologram.  The  situation  is  similar  to 
that  of  holographic  optical  memories.  The  required  spatial  bandwidth  of 
each  ROM  is  minimized  by  the  use  of  point  source  addresses.  For  a  ROM 
representing  a  modulus  of  32  a  field  of  32  by  32  possible  point  source 
address  would  be  needed.  Each  of  the  point  sources  must  be  asociated 
with  an  image  of  32  by  32  elements. 

(5)  Whv  Optics? 
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FIG.  34:  OPTICAL  TURTLE  USING  REFLECTIVE 
HOLOGRAMS 
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FIG.  35:  SANDWICH  VERSION  OF  OPTICAL 
TURTLE 
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The  optical  version  has  some  unique  advantages.  The  functions  of 
addressing,  storage,  and  distribution  of  information  are  all  done  in 
bulk.  Making  100  optical  ROMs  should  not  be  much  more  complex  than  mak¬ 
ing  10  ROMs.  This  approach  could  potientially  benefit  from  economies  of 
scale.  Another  advantage  is  that  such  a  processor  might  be  easier  to 
mass  manufacture,  since  reproduction  of  the  holograms  would  be  a  photo¬ 
graphic  process  rather  than  one  of  assembly,  as  electronic  versions 
would  be.  A  final  point  is  that  the  use  of  optics  considerably  simpli¬ 
fies  the  communication  problem.  Optical  signals  can  pass  through  each 
other  without  interference.  They  do  not  have  to  be  shepherded  around 
with  wires  as  in  the  electronic  version.  Conventional  wiring  contributes 
an  appreciable  amount  to  the  complexity  and  volume  of  current  proces¬ 
sors.  A  complete  optical  processor  offers  the  hope  of  a  more  compact  and 
easier-to-fabricate  processor. 

The  optical  version  does  have  a  serious  disadvantage.  The 
throughput  rate  of  such  a  unit  is  basically  the  recriprocal  of  the  cycle 
time  of  the  contrast  inverting  mechanism.  Currently  the  most  available 
unit  is  the  Hughes  Liquid  Crystral  Light  Valve  (LCLV)  which  has  a  cycle 
time  of  about  a  millisecond.  To  make  such  a  processor  viable  in  terms 
of  throughput,  many  points  have  to  be  processed  in  parallel  to  compen¬ 
sate  for  the  slow  cycle  time. 

The  relative  merits  of  an  optical  and  electronic  approach  are  dep¬ 
icted  in  Fig.  36.  The  increase  of  complexity  of  an  electronic  version 
with  increased  throughput  can  be  predicted  quite  accurately.  Admitt- 
tedly,  an  optical  approach  has  a  high  initial  overhead,  but  its  increase 
in  complexity  with  increased  throughput  is  unknown.  The  throughput  at 
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FIG. 
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36:  COMPLEXITY  VS.  THROUGHPUT  FOR  ELECTRONIC 
AND  OPTICAL  IMPLEMENTATIONS 
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which  optics  overhead  would  be  completely  amortized  is  another  open 


question.  Electronic  versions  of  the  turtle  processor  will  become  more 
and  more  significant  because  of  their  outstanding  throughput,  cost 
effectiveness,  and  modularity.  Larger  and  larger  systems  will  be  built. 
The  possibility  of  an  optical  approach  will  continually  haunt  this 
growth . 
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III.  RESEARCH  ON  RESIDUE  ERROR  DETECTION  AND  CORRECTION  METHODS 


A.  Overview 


The  optics  community  is  interested  in  research  on  the  design  of 
optical  systems  capable  of  data  computation.  One  approach  of  present 
interest  is  based  on  the  nature  of  arithmetic  operations  (addition  and 
multiplication)  in  residue  number  systems  [Ref.  1].  Residue  number  sys¬ 
tems  have  desireable  properties.  Most  significantly  of  these,  perhaps, 
is  that  residue  additions  do  not  involve  carry  operations.  The  hope  is 
that  the  natural  parallelism  of  optical  systems  can  exploit  the  fact 
that  no  carries  are  necessary. 

Unfortunately,  residue  number  systems  also  have  very  undesireable 
properties.  The  worst  of  these  is:  a  small  error  in  an  analog  process 
used  to  perform  a  residue  addition  may  result  in  large  error  in  the 
result  of  the  residue  addition.  The  intrinsic  nature  of  this  error  pro¬ 
cess  is  discussed  in  the  (3)  On  Noisy  Residue  Operations  (Section  B) . 

To  date  most  research  has  focused  on  the  design  of  physical  sys¬ 
tems  that  perform  basic  "residue  operation"  such  as  the  mod  operation  or 
addition  mod  some  moduli.  There  has  been  little  published  material  on 
the  development  of  a  general  mathematical  system  based  on  residue  number 
systems  capable  of  incorporating  error  detecting/correcting  operations. 
The  purpose  of  this  paper  is  to  make  the  following  three  contributions 
towards  the  development  of  such  a  general  mathematical  system. 

First,  a  general  mathematical  system  capable  of  characterizing  (i) 
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any  mathematical  operation  performed  in  a  residue  number  system  and  (ii) 


all  error  detection/correction  methods  is  constructed.  This  construc¬ 
tion  is  undertaken  in  Sections  B,  C,  and  D  and  completed  in  Section  E. 
The  main  result  is  found  in  the  subsection  (3)  The  Error  Checking  System 
(Section  D). 

Secondly,  and  importantly,  a  methodology  for  research  into  the 
intrinsic  capabilities  of  all  possible  error  detecting/correcting 
methods  is  proposed.  This  is  done  for  the  most  part  in  Section  E. 

Finally,  in  Sections  D,  F,  G  are  found  examples  of  error 
detecting/correcting  methods.  The  special  example  indicated  in  the 
abstract  is  in  "Example:  Special  case  of  redundant  encoding"  (Section 
F). 


In  addition  to  the  three  contributions  above,  this  chapter 
discusses  in  a  heuristic  way  concepts  of  system  complexity  and  cost. 
This  discussion  begins  in  the  subsection  (4)  Some  Heuristics  About  An 
Error  Checking  System  (Section  B)  with  a  few  brief  comments.*  The  topic 
of  cost  and  complexity  continues  in  the  examples  of  error-checking 
methods  given  in  Sections  D,  F,  and  G.  The  importance  of  defining  phy¬ 
sically  meaningful  complexity  measures  is  discussed  in  the  subsection 
(5)  Statement  Of  Research  Methodology  (Section  E)  and  in  the  Summary 
(Section  H). 


The  major  topics  developed  in  this  paper  are  briefly  summarized  in 
Section  H. 

▼  _"Fr  ror-checking"  is  a  phrase  used  through  out  this  paper  to  in¬ 
dicate  a  particular  kind  of  system.  Error-checking  systems  are 
those  systems  capable  of  any  type  of  error  detecting  and/or 
correcting,  i.e.,  capable  of  any  type  of  "check"  for  errors. 
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B.  A  FIRST  STEP  TOWARDS  AN  ERROR  CHECKING  SYSTEM 


( 1 )  The  Basic 


We  are  interested  in  performing  a  given  mathematical  mapping 
(operation).  In  this  chapter  the  mapping  to  be  performed  will  always  be 


denoted  by: 


T  A  the  given  mapping. 


The  domain  and  range  of  T  is  always  a  finite  set  of  nonnegative  integers 


Domain  T  =  Range  T  =  3C  =  (0,1,2 . M-1 ) 

where  M-1  is  the  maximum  element  of  3C.  In  the  remainder  of  this  section 
(and  paper)  we  will  take  the  viewpoint  that  the  mapping  T  accurately 
describes  the  input/output  relationship  of  some  physics)  system  designed 
to  realize  the  mapping  T.  The  symbol  T  will  serve  double  duty:  (i)  it 
will  stand  for  the  mathematical  mapping  in  a  rigorous  sense  and  (ii)  it 
will  represent  the  physical  system  designed  to  perform  the  mapping  T. 
Which  usage  is  intended  will  be  clear  from  context. 


(2)  The  Decomposition  of  T 


In  order  to  take  advantage  of  the  desireable  properties  of  residue 
number  systems,  the  operation  T  maybe  decomposed  into  the  3-stage  pro¬ 
cess  : 


~-n>.  r(x)  — t(x)  — S-e. w  T(x). 


(2.1) 


The  brief  discussion  of  this  process  that  follows  is  not  intended  to  be 
mathematically  rigorous.  Its  purpose  is  only  to  give  a  quick  overview 
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of  the  process.  Definitions  and  descriptions  that  need  to  be 
strengthened  and  made  rigorous  are  deferred  to  the  remaining  sections. 


First,  the  operation  En  "encodes"  the  integer  input  x  into  a  nxl 
"residue  vector"  r(x).  This  residue  vector  r(x)  is  just  an  ordered  col¬ 
lection  of  elements  whose  form  is  x  modnr  for  i=1 ,2, . . . ,n .* 


Definition  for  _r(_x) 

The  residue  vector  r(x)  is  the  integer  input  x  encoded  by  the 
operation  En: 


x 


r(  x) 


x  modm1 
x  modm^ 


(2.2) 


1  x  mod  1 
1  n  1 

I  11  I 

where  the  set  of  moduli  JTl  =  {trr  such  that  i=1 ,2, . . . ,n)  are  pair¬ 
wise  relatively  prime  and  the  range  M  of  the  set  7T1  is 


M  =  IT - 

i=1  1 

After  the  input  x  has  been  encoded  to  the  residue  vector  r(x),  the 
operation  OpT  performs  an  operation  on  the  residue  vector  r(x) 
equivalent  to  T's  operation  on  x.  The  operation  OpT  is  equivalent  in 
the  sense  that  T(x)  can  be  determinded  from  the  intermediate  output 
t(x)=OpT(r( x) ) . 


"*  A  "residue  vector"  is  any  vector  of  the  form  used  to  define  the 
residue  vector  r(x).  Note  that  because  the  elements  of  the  moduli 
setTTt  are  all  relatively  prime  by  definition,  there  is  no  "redun¬ 
dant"  information  (with  respect  to  mod  operations)  about  the 
scalar  input  x  in  the  residue  vector  r(x). 


72  - 


Definition  for  t(x) 


The  operation  OpT  maps  the  residue  vector  r(x)  to  the  residue 
vector  t(x): 


r(x) 


Qp.I» 


t(  x) 


'T(x)  modm1 
i 

;  T ( x )  modm^ 


T(x)  modm 

n 


(2.3) 


This  mapping  represents  the  mathematical  operation  to  be  performed, 
i.e.,  addition,  subtraction,  etc. 


In  the  third  and  final  stage  the  operation  De  "decodes"  the  residue 
vector  t ( x )  to  the  value  T(x). 


It  is  significant  to  note  that  any  mapping  defined  on  an  integer 
set  5=  {0 , 1 , 2, . . . ,M-1 }  to  the  same  set  can  be  realized  by  this  decom¬ 
position.  In  essence,  then,  the  first  purpose  of  this  paper  is  half 
completed : -  the  3-stage  process  described  in  (2.1)  can  characterize  any 
mathematical  mapping  (operation)  performed  in  a  residue  number  system 
with  range  M. 

But  what  about  the  errors?  Why,  how,  and  where  do  they  arise? 
What  is  there  nature?  The  "why  and  how"  is  a  topic  not  treated  in  this 
chapter.  The  "where"  and  what  to  do  about  it  is  the  primary  topic  of 
interest . 


(3)  On  Noisy  Residue  Operations 

First,  consider  a  "noisy"  encoding  process  En : 
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x  _  gn  r(x)  +  e(x)  =  r' (x)  (2.4) 

where  e(x)  is  a  nxl  random  vector.  More  will  said  about  the  nature  of 
e(x)  in  the  following  sections.  Suffice  it  to  say  now  that  the  form  of 
the  noisy  residue  vector  r'(x)  remains  suitable  for  input  to  the  OpT 
operation,  i.e.,  r'(x)  is  itself  a  residue  vector.  In  addition  to  a 
noisy  encoding  process  En ,  the  OpT  and  De  operations  might  be  noisy. 
More  will  be  said  about  these  cases  in  the  following  sections. 

With  the  introduction  of  possible  error  in  the  encoding  process  we 
come  to  a  key  issue:  How  will  a  nonzero  error  vector  e(x)  affect  the 
output  of  the  3-stage  process  described  in  (2.1)?  Without  going  into 
great  detail  at  this  point,  it  is  possible  to  demonstrate  that  in  gen¬ 
eral  the  error  in  the  output  is  "not  well  behaved."  As  an  illustration 
of  what  "not  well  behaved"  means  in  a  heuristic  sense  consider  the  fol¬ 
lowing  example. 

Example. 

The  process  below  encodes  with  a  noisy  encoder  En  and  then  decodes 
with  a  perfect  decoder  De: 

x  En..>.  r(x)  +  e(x)  _ x  +De(e(x)).  (2.5) 

The  decoding  operation  can  be  realized  as 

n 

y  =  [  5  «—  *b.*(y  modm.+error  in  it'n  residue)]  modM  (2.6) 

i=1  i  1  1 

where  t'ne  b^,  i=1  to  n,  are  a  set  of  integers  and  all  nr  are  contained 
in  the  set  of  moduli  used  to  encode  x. 

Suppose  the  input  x  =  y  =  0  in  (2.5),  then  using  the  decoding  for¬ 


mula  in  (2.6)  the  t'ne  decoded  noisy  output  is  given  by 


De(r* (x) ) 


x  +  De(e(x))  =  0  +De(e(x)) 


r  r 

-  L  2  — *b.* (O+error  in  the  ith  residue)]  modM 
.  ,m.  1  — 

i=1  l 

n  M 

=  [  2  — *b..(  error  in  the  ith  residue  )]  modM.  (2.7) 

.  .m,  l 
i=1  l 

The  result  displayed  in  (2.7)  demonstrates  clearly  that  if  just  one  of 

the  residue  elements  x  modrrK  is  in  error  by  only  1  the  decoded  number  is 

M 

off  by  a  multiple  of  the  factor  ( —  modm.)  modM!  Hence  a  small  error  in 
1  K  m.  l 

l 

the  encoding  process  (the  residue  x  modrru  is  off  by  1)  can  lead  to  large 
error  in  the  output  (the  result  is  off  by  some  multiple  of  the  factor 
(—  modirr)  modM)  .  In  principle,  errors  in  the  operation  OpT  will  exhi¬ 
bit  the  same  type  of  behavior  as  that  discussed  above. 

In  what  ways  may  errors  like  those  described  in  the  above  example 
be  detected?  In  what  ways  may  such  errors  be  corrected?  Sections  C,  D, 
and  E,  develop  a  mathematical  system  whose  structure  will  allow  meaning¬ 
ful  and  definite  answers  to  these  questions. 

(*• )  Some  Heuristics  About  An  Error  Checking  System 

It  is  interesting  to  note  that  even  before  one  undertakes  the 
developement  of  any  particular  system  one  can  deduce  some  important 
facts  about  the  general  nature  of  such  a  system.  First,  we  expect  the 
"cost"  of  any  system  to  be  proportional  to  (i)  the  probability  of  error, 
(ii)  the  nature  of  the  possible  error  and  (iii)  the  overall  complexity 
of  the  physical  system.  Secondly,  any  physically  meaningful  cost  cri¬ 
terion  will  certainly  be  based  on  some  measure  of  system  complexity. 
This  measure  of  complexity  will,  in  general,  take  into  account  difficul¬ 
ties  in  (i)  realizing  given  types  of  mathematical  operations  in  a 
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physical  system  and  (ii)  interfacing  the  necessary  operations.  Finally, 
at  this  point,  it  seems  reasonable  to  expect  there  will  be  many  dif¬ 
ferent  conceptual  approaches  to  realize  error-checking.  These  different 
conceptual  approaches  will  demand  at  least  some  differences  in  the  phy¬ 
sical  systems  designed  to  realize  these  approaches.  Therefore,  what  we 
mean  by  system  complexity  will  probably  have  a  significant  impact  on  the 
"costs"  of  different  approaches. 
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C.  Second  Step  Towards  An  Error  Checking  System 

( 1 )  The  Basic  Process 

Shown  in  fig.  3.1  is  a  block  diagram  representation  for  the  3-stage 
process 

x  _ En_^  r(x)  _0£l^  t(x) _ P£_».  T(x)  (2.1) 

discussed  in  Section  B.  The  scalar  input  x  is  encoded  to  a  residue  vec¬ 
tor  r(x).  The  residue  vector  r(x)  is  then  transformed  by  OpT  to  the 
residue  vector  t(x).  The  vector  t(x)  is  decoded  by  De  to  the  output 
T(x).  The  basic  process  in  fig.  3.1  has  no  explicit  "error-checking" 
capability.*  So,  if  any  of  the  operations  are  error  prone  there  is  no 
way  to  check  errors.  It  is  possible,  however,  to  develop  a  system  that 
is  derived  from  this  basic  system  and  is  capable  of  error-checking. 

(2)  Introducing  Error  Checking  Operations 

As  a  first  step  towards  developing  this  error-checking  system,  two 
operations  will  be  added  to  the  basic  process.  The  result  is  the 
error-checking  basic  process  in  fig.  3.**  Can  it  be  this  simple?  Can  we 
expect  to  accomplish  error-checking  simply  by  adding  on  the  operations 
OpFn  and  "something?( related  to  T)." 

The  answer,  clearly,  is  no.  As  given  in  the  fig.  3.2  the  new 

operations  "look"  only  at  the  noisy  residue  vector  r(x)+e(x)  and  the 

The  phrase  n  error-checking"  is  used  through  out  the  remainder 
of  the  paper  in  place  of  the  longer  phrase  "error 
detecting/correcting." 

**  The  decoding  operation  De  will  be  assumed  to  be  error  free 
through  out  the  remainder  of  the  paper. 
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noisy  residue  vector  t(x)+e'(x).*  The  way  the  encoding  process  En  was 
defined  in  (3.2)  does  not  allow  for  any  redundancy  in  the  residue  (modN 
encoding.  The  only  "information"  OpEn  has  about  x  is  the  noisy  residue 
vector  r(x)+e(x).  Similiarly,  the  only  information  the  operation 
"something? ( rel ated  to  T)"  has  about  T(x)  is  the  noisy  residue  vector 
t(x)+e'(x).  In  order  for  the  operation  OpEn  to  check  errors  in  the 
.  encoding  of  x  to  a  residue  vector,  there  must  be  some  input  to  OpEn 
other  than  just  the  noisy  residue  vector  r(x)+e(x).** 

(3)  Introducing  Pre-encoding 

This  leads  us  naturally  to  the  concept  of  "pre-encoding."  Pre¬ 
encoding  is  discussed  in  the  beginning  of  Section  D  and  again  in  Section 
E.  In  order  for  an  error-checking  operation  OpEn  to  accomplish  any 
error-checking,  the  operation  OpEn  "needs"  more  information  about  x  than 
just  the  noisy  residue  vector  r(x)+e(x).  The  purpose  of  a  pre-encoding 
operation  is  to  generate  different  and  independent  information  about  x. 
This  new  information  is  used  as  additional  input  to  some  modified 
error-checking  operation  OpEn.  The  new  information  is  assumed  to  be 
error  free. 


"*  The  vector  e 1  ( x)  is  I~  random  vector  whose  affect  on  t(x)  is 
analogous  to  the  affect  the  random  vector  eCx)  has  on  r(x)  (dis¬ 
cussed  in  Section  P) . 

**  In  principle,  the  general  description  and  structure  of  the 
error-checking  operation  OpEn  given  in  Section  E  is  sufficient  to 
define  a  suitable  error-checking  operation  OpT.  In  order  to 
economize  on  words  and  simplify  the  structure  of  the  paper,  atten¬ 
tion  is  focustJ  mainly  on  OpEn.  This  does  not  imply  a  lack  of 
generality. 
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D.  Final  Step  Towards  An  Error  Checking  System 

( 1 )  The  Basic  System 

The  basic  structure  of  the  system  illustrated  in  fig  <4 . 1  is  almost 
the  same  as  the  structure  of  the  simple  system  developed  in  Section  B 
and  shown  in  fig.  3.1.  In  comparing  the  two  systems,  one  can  see  that 
the  only  real  difference  in  the  structure  is  the  addition  of  the  opera- 
tion  PreEn  which  pre-encodes  the  input  x. 

In  place  of  the  integer  input  x  we  now  have  a  vector  x: 

x  Pre_E-V3. 

The  vector  x  should  be  thought  of  as  a  kxl  vector  which  "carries"  infor¬ 
mation  about  the  number  x.  The  dimension  k  and  the  specific  form  of  the 
individual  elements  in  x  is  not  discussed  again  until  Section  E  (in  (2) 
Towards  Defining  ).  The  kind  of  information  x  may  carry  about  x  is 
quite  general:  Whatever  one  might  dream  up  and  be  able  to  encode.  For 
example,  x  maybe  the  input  x  repeated,  some  function  of  x,  number 
theoretic  properties  like  evenness/oddness  or  primeness.  Following  the 
pre-encoding  step  the  vector  x  passes  through  the  remaining  stages  in  a 
way  analogous  to  the  integer  input  x  passing  through  the  stages  of  the 
process  shown  in  fig.  3.1. 

(2)  Description  Of  Basic  System 

First,  x  is  encoded  by  the  encoding  process  Fn : 

x  En  >  r  (  x) 

where  ?(x)  is  some  vector  that  takes  the  place  of  the  residue  vector 
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THE  BASIC  SYSTEM  USED  TO  CONSTRUCT  AN 
ERROR  CHECKING  SYSTEM  SHOWN  IN  FIG.  4.2. 


Some  convenient  output 


other  operations 
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THE  ERROR  CHECKING  SYSTEM 


r(x).  In  place  of  the  original  residue  encoding  process  Fn  is  the 
encoding  process  En  that  encodes  in  part  by  mod  operations  and  in  part 
by  a  set  operations  that  are  necessary  for  error  correction/detection. 
The  vector  r(x)  contains  all  the  information  about  x  available  to  the 
system.  It  passes  through  the  operation  OpT: 

r(x)  .  0£L>  ~(x)  . 

The  vector  t(x)  contains  all  the  information  available  to  the  system 
about  x  and  T(x).  Finally  t(x)  is  decoded  by  the  decoder  He  to  T (x): 

T(x)  Ig— . >.  T  (  x ) . 

It  should  be  noted  that  the  output  of  the  decoder  will  in  general  really 
be  some  vector  whose  elements  contain  T(x),x  and  any  other  information 
in  the  pre-encoded  x  vector.  This  then  is  the  nature  of  the  basic  sys¬ 
tem.  What  will  be  added  to  this  system  for  error  detecting/correcting? 
What  purpose  does  the  pre-encoder  PreEn  really  serve?  These  questions 
are  handled  next. 

(3)  The  Error  Checking  System 

In  fig  4.2  is  the  error-checking  system.  One  can  see  that  the 
basic  system  has  been  altered  by  eliminating  any  explicit  mention  of  a 
decoding  stage,  relabelling  the  OpT  stage  to  T,  and  adding  two  new  error 

es*  <•*«* 

checking  operations  OpFn  and  OpT.*  What  are  these  error-checking  opera- 

/V 

tions  OpEn  and  OpT?  How  do  they  relate  to  the  still  mysterious  pre¬ 
encoding  operation  PreEn? 

*  The  decoding  process  De  is  not  explictity  shown  for  the  sake  of 
simplicity.  The  T  operation  is,  in  principle,  identical  with  the 
OpT  operation  in  fig.  4.1.  The  label  is  changed  only  to  facili¬ 
tate  other  notation. 
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Discussion  of  OpEn  and  PreEn 

The  operation  OpEn  has  as  its  input  the  vector  r(x).  This  vector 
hopefully  contains  the  correct  encoding  of  the  input  vector  x.  The 
operation  OpEn  is  constructed  so  that  any  "inconsistencies"  in  the 
encoded  information  about  x  in  r(x)  can  be  detected  by  examination  of 
OpEn's  output.  Note  that  in  general,  the  pre-encoding  process  PreEn 
and  the  error-checking  process  OpEn  are  highly  related.  Any  particular 
choice  for  a  pre-encoder  certainly  will  be  influenced  by  ideas  about 
what  types  of  operations  OpEn  are  (i)  mathematically  interesting,  (ii) 
conceptually  powerful,  (iii)  physically  realizeable  and  (iv)  feasible 
under  some  cost  criterion.  Perhaps  it  will  be  instructive  to  give  two 
straight-forward  examples  of  error-checking  operations.  The  nature  of 
PreEn  and  OpEn  will  be  described  in  each  case. 

Example  one 

This  example  shows  "odd  shift"  errors  can  easily  be  detected.  What 
are  odd  shift  errors? 

Definition  f or  odd  shift  errors 

Suppose  a  scalar  input  x  is  encoded  by  a  noisy  encoding  pro¬ 
cess  to  the  residue  vector  r(x)+e(x)  and  then  decoded  to  the  scalar 
x+De(e(x)).  If  De(e(x))  is  an  odd  number  then  by  definition  r(x) 
(or  equivalently  x)  has  undergone  an  "odd  shift"  error  in  encoding. 
How  can  such  errors  be  detected?  If  the  set  of  moduli  used  to  encode  x 
are  all  odd,  such  detection  can  be  accomplished  by  generating  the  addi¬ 
tional  information  x  mod2. 


aaaii 
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As  a  special  case  consider  encoding  the  3  domain  =10,1,2 . 8}  by 

using  only  one  modulus  =  9.  The  T  domain,  x  mod9  and  x  mod2  are  dep¬ 
icted  below  in  Table  III. 1 . 


x  : 

0 

1 

2 

3 

9 

5 

6 

7 

8 

;  9 

10 

x  mod9  : 

0 

1 

2 

3 

9 

5 

6 

7 

8 

!  9 

10 

x  mo d2  : 

0 

1 

0 

1 

0 

1 

0 

1 

0 

1  1 

0 

Table  III.  1 

What  happens  if  x  mod9  undergoes  an  odd  shift  error?  Without  loss  of 
generality  suppose  x  were  0.  Then  x  should  encode  to  0  mod9  and  0  mod2. 
Suppose  however  x  mod9  encodes  to  one.  We  now  have  a  residue  vector 


-  lx  mod9  =  1  1 
'  *  lx  mod2  =  0!  • 

But  upon  examination  of  Table  III.1  we  see  that  this  residue  vector  does 
not  correspond  to  any  input  value  in  the  input  domain  T  .  Therefore  the 
information  about  x  in  r(x)  is  inconsistent.*  The  example  used  was 
x  mod9=1.  By  examining  Table  III.1  one  can  see  that  the  information  in 
the  residue  vector  would  be  inconsistent  if  x  were  encoded  to 
x  mod9=1,3,5,  or  7.  One  can  also  convince  oneself  by  using  examples  of 
one's  own  that  all  oddshift  errors  for  any  x  c ‘1  input  can  be  detected. 


The  pre-encoding  process  involved  encoding  x  mod2.  The  error¬ 
checking  operation  OpFn  decides  if  the  information  in  r(x)  (in  this  case 
P(x)  is  still  a  residue  vector)  is  consistent.  This  can  be  accomplished 
by  using  a  polynomial  transformation  with  2x9=18  degrees  of  freedom 

*  Notice  that  this  r(x)  does  correspond  to  the  integer  9  e 1 ,  That 
is,  if  we  had  encoded  the  1  domain  =  (0, 1 ,2, . . . ,  17}  with  the 
moduli  set  2  and  9  then  the  r(x)  would  not  be  inconsistent. 
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(DOF).*  The  increase  in  system  cost  for  this  type  of  error-checking 
should  be  expressible  as  something  with  the  form: 

Cost  of  error-checking  +  cost(x  mod2  operation)  +  cost(OpEn  with  18  DOF). 
System  cost  is  also  decreased,  however,  due  to  the  elimination  of  cer¬ 
tain  error  types.  The  remaining  cost  due  to  errors  has  the  form: 

Cost  after  error-checking  =  cost(even  shift  errors).** 

It  is  not  necessary  to  restrict  the  residue  encoding  process  to  1 
modulus.  Any  set  of  relatively  prime  odd  moduli  used  for  encoding  could 
be  checked  for  odd  shift  errors  using  a  perfect  x  mod2  encoding.  The 
increase  in  system  cost  would  be  of  the  form: 

Cost  of  error-checking  =  cost(  x  mod2  operation) 

+  cost(0pEn  with  2  M  DOF). 


Example  two 

This  example  demonstrates  that  all  errors  except  "multiples  of  a 
particular  type  of  redundant  modulus"  can  be  detected  easily.  What  form 
do  these  redundant  moduli  take?  Usually  when  one  says  two  moduli  m1  ancj 

m2  are  redundant  one  means  that  m^  and  m^  have  common  divisors.  The 
type  of  redundant  moduli  used  here  are  those  for  which  one  divides  the 

other,  e.g.,  m^.^  and  m2=9. 

As  a  special  case  consider  the  same  “t  domain  as  in  example  1  and 

*  There  are  2x9=18  possible  inputs  to  OpEn. 

**  I  am  assuming  the  nature  of  the  error  process  is  such  that  er¬ 
rors  are  more  or  less  uniformly  distributed  over  the  input  domain 
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encode  x  mod9  and  x  mod3.  The  domain,  x  mod9  and  x  mod3  are  depicted 
in  Table  III .2  below. 


x 

:  0 

1 

2  3  4  5  6  7  8 

!  9 

10  .  .  . 

x  mod9 

:  0 

1 

2  3  4  5  6  7  8 

!  0 

1  .  .  . 

x  mod  3 

:  0 

1 

2  0  1  2  0  1  2 

!  0 

1  .  .  . 

Table  III .2. 

Now  what  happens  when  x  mod9  is  incorrectly  encoded?  Without  loss  of 
generality  suppose  x  were  0.  Then  x  should  encode  to  0  mod9  and  0  mod 
3.  If  x  mod9  encodes  to  1,2, 4, 5, 7,  or  8  this  will  be  inconsistent  with 
the  value  for  x  mod3=0.  Therefore  error  can  be  detected  in  these  cases. 
Error  can  not  be  detected  if  x  mod9=3  or  6. 

As  an  aside,  note  that  if  the  error  process  were  such  that  the 
encoded  residue  x  mod9  could  shift  from  the  correct  value  by  only  1, 
then  all  errors  could  be  detected  and  corrected  using  the  additional 
information  about  x  contained  in  the  residue  x  mod3.  This  special  case 
is  discussed  more  fully  and  generalized  in  "Example:  Special  case  of 
redundant  encoding"  (Section  F) . 

The  pre-encoding  process  involved  encoding  x  mod3.  The  error¬ 
checking  operation  OpEn  decides  if  the  information  in  x  mod9  and  x  mod3 
is  consistent.  This  can  be  accomplished  by  using  a  polynomial  transfor¬ 
mation  with  3x9=27  DOF.  The  increase  in  system  cost  for  this  type  of 
error-checking  takes  the  same  form  as  that  descibed  in  Example  1. 

It  is  not  necessary  to  restrict  the  residue  encoding  process  to 
only  one  modulus.  Many  sets  of  relatively  prime  moduli  used  for  encod- 
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ing  could  be  checked  for  errors  using  an  error-checking  set  of  moduli 
with  a  smaller  range.  The  technique  requires  only  that  the  set  with 
the  smaller  range  be  relatively  prime  and  that  the  smaller  range,  call 
it  S,  divide  the  range  of  the  set  used  for  the  encoding.  The  range  of 
the  encoding  set  was  defined  in  Section  B  as  M.  Hence,  S  must  divide 
M.*  All  errors  in  the  %  domain  =  {0, 1 ,2, . . .M-1 }  can  be  detected  except 
those  of  the  form: 

x  +  k*S,  k=1 ,2,3 . 

The  increase  in  system  cost  should  be  of  the  form: 

Cost  of  error-checking  =  cost(encode  x  with  error-checking  moduli  set) 

+  costCOpEn  operation  with  S  M  DOF) .** 

Discussion  of  OpT 

The  error-checking  operation  OpT  is  constructed,  in  principle,  the 
same  way  the  operation  OpEn  is  constructed.  We  know  what  operation  T  on 
x  we  want  to  perform.  The  nature  of  T  and  the  design  of  the  pre-encoder 
and  OpEn  "designs  for  us"  the  necessay  T  operation.  Based  on  the  vector 
output  t(x),  the  error-checking  operation  OpT  checks  for  any  incon¬ 
sistencies  in  the  transformed  information  about  x  in  t(x). 

General  Discussion 

Note  that  in  general,  one  would  probably  not  construct  any  of  the 
operations,  PreEn.En,  T,  OpEn,  or  OpT  without  giving  careful  considera- 

'*  The  range  S  Is  trie  product  of  all  the  moduli  in  the  error¬ 
checking  set  of  moduli  used  to  check  the  encoding. 

**  It  turns  out  that  the  error  correcting  method  discussed  in  "Ex¬ 
ample:  Special  case  of  redundant  encoding"  (Section  F)  has  a  sig¬ 
nificantly  smaller  cost  than  that  suggested  by  this  formula. 
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tion  to  the  ways  all  the  operations  may  interrelate.  If  some  criterion 


for  goodness  of  physical  design  takes  into  consideration  range,  speed, 

complexity,  reliability,  etc.,  we  certainly  expect  these  operations  to 
be  highly  related. 

For  the  sake  of  completeness  a  very  brief  discussion  on  how  an 
overall  system  is  controlled  by  outputs  from  error-checking  operations 
is  given  next.  Such  a  complete  error-checking  system  is  shown  in  fig. 
3. 


(4)  Complete  Error  Checking  System 

In  the  error  correcting  system  shown  in  fig  4.2  no  provision  was 
made  either  for  the  interpretation  of  the  output  from  the  error  correct- 
ing  operations,  OpEn  and  OpT,  or  for  overall  system  control  governed  by 
such  interpretation.  For  the  sake  of  completeness  the  system  in  fig. 
4.3  makes  such  provision  by  the  addition  of  the  System  Self  Checking 
Control  Units. 

The  conceptual  framework  for  a  complete  system  has  been  developed. 
What  do  we  do  now?  There  are  three  related  steps  to  take.  First, 
develop  as  powerful  and  as  orderly  a  method  for  investigating  how  the 
operations  in  the  set  {PreEn.En,  OpEn,  OpT,  T}  are  related.  Second, 
develop  physically  meaningful  measures  of  system  complexity,  reliability 
and  feasibility.  Finally,  use  the  knowledge  and  understanding  gained  in 
the  first  two  steps  to  design  actual  physical  systems.  Section  E  under¬ 
takes  the  first  step.  Note  is  also  taken  of  the  importance  of  develop¬ 
ing  physically  meaningful  measures  of  complexity. 
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E.  RESEARCH  METHODOLOGY 


( 1 )  General  Discussion 

A  system  concept  general  enough  to  treat  most  computational  prob¬ 
lems  of  interest  as  special  cases  is  developed  in  Section  D.  A  block 
diagram  representation  of  this  system  is  shown  in  Fig.  4.2.  The  purpose 
of  this  section  is  primarily  to  develope  a  methodology  for  research  on 
error-checking  methods.  The  system  concept  defined  in  Section  D  is  only 
the  first  step  towards  this  goal.  We  still  need  to  impose  additional 
mathematical  structure  on  the  system  before  a  more  rigorous  mathematical 
analysis  may  begin.  This  mathematical  structure  will  be  imposed 
(implied)  by  the  definitions  for  the  various  elements  x,  r(x),  and  t"(x) 
as  well  as  the  operations  PreEn ,  En ,  cfpEn ,  T,  and  OpT.  There  are,  of 
course,  an  infinity  of  mathematical  structures  from  which  to  choose.* 

What  attributes  should  the  definitions  have?  Primarily  we  require 
the  definitions  to  meet  a  "let' s- make-progress"  criterion.  Secondly, 
the  generality  of  the  system  must  be  maintained.**  Tnirdly,  both  the 
definitions  themselves  and  the  structure  imposed  on  the  system  by  the 
definitions  must  be  clear  enough  so  that  the  heuristic  may  serve  as  a 
powerful  aide  and  guide  to  further  mathematical  analysis. 

Fortunately,  all  three  of  these  criteria  can  be  met.  Tnere  is  a 
way  of  viewing  the  noisy  residue  encoding  process 

x  _ r(x)  ♦  e(x)  (5.1) 

"*  The  author  has  considered  a  significant  number  there  of! 

**  That  is,  we  want  to  be  able  to  realize  any  mapping  from  1  to 


•  .twu-- 


89 


(  J  (' 

that  implies  a  natural  requirement  on  the  vector  x\  Once  ”x  is  suitably 
defined,  definitions  for  all  the  remaining  elements  and  operations  fol¬ 
low. 


(2)  Towards  Defining  x_ 

Think  of  the  scalar  input  x  in  (5.1)  as  representing  the  set  of  all 
possible  incorrect  outcomes  of  the  noisy  encoding  process.  This  point 
of  view  establishes  a  correspondence  between  x  and  a  set  of  residue  vec¬ 
tors: 


x  ^ (r(x)  +  e(x)  for  all  e(x)  of  possible  interest  } .*  (5.2) 

Now  every  r(x)  +  e(x)  will  decode  to  some  number 

Do 

r(x)  +  e(x) - >.  x  +De(e(x))  =  x  +  e(x) 

where  e(x)  =  De(e(x)).  So,  we  might  just  as  well  establish  the 

correspondence  between  x  and  the  set  of  decoded  residue  vectors: 

x  - >»  &(x)  =  {  x  +  e(x)  for  all  e(x)}.**  (5.3) 

For  the  purposes  of  notational  simplicity  the  latter  correspondence  is 
used  in  the  discussion  that  follows. 

As  a  next  step  the  set  5>(x)  in  (4.3)  is  partitioned  into  the  dis¬ 
tinct  sets 


4(x)  =  {  x  }U  (  x  +  e(x)  for  all  e(xWO).  (5.4) 

"*  In  general,  one  presumedly  would  like  a  system  to  be  error 
free.  There  may  be  special  cases,  however,  where  certain  error 
types  are  considered  far  more  costly  or  "disasterous"  than  others. 

In  these  cases  t'ne  primary  concern  may  be  to  eliminate  only  these 
particular  error  types. 

**  From  here  on  the  phrase  "  of  possible  interest"  as  a  qualifier 
on  the  set  will  not  be  used  explicitly.  Unless  stated  otherwise, 
however,  this  qualifier  is  implied. 
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Define  the  set  ?(x)  by 


r 


( 


?(x)  =  {  x  +  e(x)  for  all  e(x)/0). 

The  set  ?(x),  then,  is  the  set  containing  all  the  possible  outcomes  of 
the  noisy  encoding  process,  given  x  is  the  input. 

The  input  domain  1=  {  0,1,2 . ,M-1  }  can  be  partitioned  then  into 

the  2  distinct  sets 

"1  =  ?(x)  u  ?(x)c.» 

So  we  have  a  partition  of  ‘1?  V/hat  purpose  does  it  serve?  We  are  look¬ 
ing  for  a  suitably  well  defined  definition  for  the  vector  7.  If  x  is  to 
"supply"  enough  information  about  the  true  input  such  that  all  errors 
can  be  detected,  then  certainly  it  must  supply  enough  information  to 
tell  us  unambiguously  if  the  output  x  +  e(x)  is  an  element  of  ?(x)  or 
3>(x)c. 

This  error-checking  may  be  accomplished  if  7  itself  induces  a  par¬ 
tition  on  3C ,  the  same  partition  on  T  induced  by  x.  That  is,  x  must  con¬ 
tain  at  least  enough  information  to  induce  the  partition  'f(x)  i j  ?(x)c. 
The  error-checking  can  then  be  done  by  asking 

is  x  +  e(x)c  ?(x)  or  not? 


Definition  for  S. 

The  vector  x  is  the  output  of  the  PreEn  pre-encoding  process. 
It  contains  all  the  information  about  the  scalar  input  x  available 
to  the  system.  It  is  a  kxl  vector  partitioned  into  the  two  subvec- 

*  ?(x)c  means  the  compliment  of  'f(x) . 
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tors  v(x)  and  a(x): 


I  ( 


x  - >>  (  v(x),  a(x)  ) 

where  v(x)  is  a  ixl  vector  and  a(x)  is  a  jxl  vector  and  k=i+j.  The 
scalar  input  x  is  represented  by  the  vector  v(x);  and  the  addi- 
tional  information  about  x  is  pre-encoded  by  the  PreEn  process  to 
the  vector  a(x).  The  pre-encoding  of 

x  - »-  v(  x) 

is  one-to-one  and  onto.  The  use  of  such  coding  allows  for  a  gen¬ 
eral  representation  for  the  input  x.  Examples  of  such  a  general 
representation  are: 


(i) 

X  =  X,+X„+X-+. . ,+x  - 

\  d  S  n 

— >-v(x)  =  (  x1 .x^.x,, . . . , 

i  x  )  and 
n 

(2) 

x  =  x!  x2  x3  ...  xn  - 

- ^-v(x)  =  (  x1 .x^.x^, . . 

.,  xn  ). 

(3)  Towards  Defining  PreEn.  Ejn,  rU) ,  OpEn,  7,  ][(x)  ,  'OpT 

All  of  the  operations  PreFn.  OpEn,  T  and  OpT  satisfy  the  following: 

(i)  the  mappings  nave  vector  inputs  (range)  and  vector  outputs 
( domain) , 

(ii)  the  elements  of  any  vector  are  nonnegative  integers, 

(iii) tne  number  ' °  possible  vector  inputs  and  outputs  is  finite,  and 

(iv)  all  of  the  mappings  can  be  realized  by  polynomial  transformations. 
Definition  for  PreEn 

The  pre-encoding  process  has  been  implicitly  defined  by  the 
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definitions  given  for  the  scalar  input  x  and  the  pre-encoded  output 
vector  5. 

Definition  for  POO  and  EnOO . 

Tne  vector  r(x)  is  the  ouput  of  the  encoder  En : 
x  ...  £{'.>  r ( x) . 

The  vector  r(x)  is  a  nxl  vector.  In  a  specific  case  r(x)  might 
satisfy  the  equality: 

r(x)  =  En(x)  =  En(  v(x),  a(x)  )  =  (  Enl(v(x))t  En2(a(x>) 
where  the  mappings  En1  and  En2  are  independent  operations  acting 
independently  on  the  vectors  v(x)  and  a(x)  as  inputs;  En^O  is  a 
1x1  vector  and  En^C.)  is  a  mxl  vector  where  l+m=n.  In  general, 
however,  the  vector  r(x)  will  satisfy  only  the  equality: 

r ( x)  =  En( x)  . 


Definition  f or  OpEn 

The  error-checking  operation  OpEn  has  as  its  input  the  nxl 
vector  r(x)  and  a  binary  scalar  output.  The  output  is  given  by 

OpEn(r(x))  =  ^  0  if  encoding  inconsistent 
|  1  if  encoding  consistent 


Definition  for  T  and  _t(_x) 

The  operation  T  has  as  its  input  the  nxl  vector  r(x)  and  a  qxl 
output  vector  T(x): 
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fix) 


»  ;t  (x) 
— >■ ; 


;t2(x> 


=  t(  x) 


wn 


ere  (.)  is  oxl  vector,  (.)  is  px  1  vector  and  o+p=k. 
subvector  T^(x)  is  the  residue  vector  for  the  scalar  T(x) 
subvector  T^Cx)  contains  the  additional  information  about  x 


encoded  by  PreEn. 


The 

Tne 

pre- 


Def ir.ition  for  OpT 

The  error-checking  operation  OpT  has  as  its  input  the  kxl  vec¬ 
tor  t(x)  and  a  scalar  binary  output.  The  output  is  given  by 

C)pT(r(x))  =  f  if  the  mapping  T  is  inconsitent 
(  1,  if  the  mapping  T  is  consistent. 


(4 )  Towards  Statement  Of  Research  Methodology 

The  proposed  research  methodology  is  based  primarily  on  the  parti¬ 
tioning  concept  introduced  in  the  subsection  Towards  Defining  x.  Any 
error  detecting/correcting  method,  no  matter  how  derived ,  implicitly 
induces  a  partition  on  the  scalar  input  range  .  Therefore,  obtaining  a 
clear  and  thorough  understanding  of  how  partitions  of  the  finite  range 
can  be  generated  from  (i)  a  collection  of  numbers  (the  elements  of  a(x)) 
and  (ii)  a  set  of  operations  on  these  numbers,  is  a  worthwhile  goal. 

If  we  had  this  understanding,  the  following  questions  could  be 
more  easily  answered. 

(i)  Given  a  set  of  operations  and  a  finite  set  of  elements  what  classes 
of  partitions  can  be  parameter ized  by  the  set  of  elements? 
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(ii)  For  a  given  class  of  partitions  that  can  be  parameter ized  what  tra¬ 
deoffs  exist  between  the  ninber  of  elements  used  and  the  type, 
number  and  order  of  operations  used  to  generate  the  partitions. 

(iii) Is  there  a  minimum  cost  approach  under  some  cost  criterion  where 
the  nunber  of  elements  and  the  type,  number,  and  order  of  opera¬ 
tions  used  to  generate  a  partition  are  the  variables  affecting 
cost . 

(5)  Statement  Of  Research  Methodology 

Our  research  efforts  should  be  directed  towards: 

(i)  acquiring  a  more  thorough  mathematical  background  on  the  nature  of 
partitions  and  how  to  generate  them, 

(ii)  continuing  to  generate  partitions  by  ad  hoc  methods  (see  the  next 
subsection  entitled  Some  Adhoc  Partitioning  Methods) , 

( iii)  developing  a  variety  of  physically  meaningful  cost  criteria  (com¬ 
plexity  measures)  likely  to  suit  most  applications,  and 

(iv)  investigating  the  relationship  between  the  choice  of  a  complexity 
measure  and  the  resultant  cost  of  a  given  partition  method  (a  sen¬ 
sitivity  analysis)  . 

(6)  Some  Adhoc  Partitioning  Methods 

(i)  Number  theoretic  properties  like  evenness/oddness  and  the  nature  of 


the  input  number's  prime  factorization  may  be  useful. 


( 


( 


(ii)  Partitions  induced  by  encoding  the  input  x  mod  some  set  of  small 
moduli  not  subject  to  error  may  prove  to  be  the  best  way.  The  par¬ 
tition  of  induced  by  this  method  given  the  input  x  is  illus¬ 
trated  below: 


0  M-1 

| - . - , - ■ - , - , - 1 

.  .  .  x-2*S  x-S  x  x+S  x+2«S  .  .  . 

where  S  =  the  product  of  the  small  error  free  moduli. 

The  partition  induced  on  by  the  moding  operations  is 
{ x }  U  (  (x+k  S)  modM,  k  =  nonzero  integers  } 

( iii) Partitions  may  be  induced  by  passing  lower  and  upper  bounds  for  the 
input  x,  i.e.,  x£(x-a,x+b). 

(iv)  Combine  the  above  two  methods. 

(v)  Partitions  could  be  generated  by  carrying  out  operations  on  the 
residue  vector.  For  example,  a  set  of  values  {  (r(x),  O^ffx))  J 
i=1,...?  },  where  (...)  is  an  inner  product  and  0^  (.)  is  an  opera¬ 
tion  defined  on  r(x),  could  be  computed. 
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F.  ERROR  CHECKING  BASED  ON  ADDITIONAL  ERROR  FREE  RESIDUE  ENCODING 


(1)  General  Discussion 


We  are  looking  for  ways  to  check  errors  in  residue  encoding.  It  is 
a  natural  question  to  ask  "Can  we  use  residue  encoding  to  check  residue 
encoding?"  The  answer,  in  general,  is  yes.  Simple  examples  of  this 
kind  of  error-checking  are  given  in  Section  D,  All  the  examples  in  this 
section  are  also  of  this  type.  There  is  a  basic  form  to  this  type  of 
error-checking.  The  following  discussion  describes  this  form.  The 
pre-encoding  operation  PreEn  generates  a  residue  vector  p(x): 


p(x)  = 


x  mods1 
x  mods,. 


jx  mod sn j 

where  4=  (s.^  '  i=1,2,...,n}  is  a  set  of  error  free  moduli  (generally 
small),  all  s^  are  relatively  prime,  and  the  range  of  &  is  S.  The 
encoding  operation  Fn  generates  a  residue  vector  r(x): 


x  modm^ 
x  modm.. 


1 x  modm  ' 

!  n! 

where  Tfl  =  (rrr  I  1=1,2,... ,n)  is  a  set  of  noisy  moduli  (generally  large), 
all  m^  are  relatively  prime,  and  the  range  offflis  M.  The  pre-encoded 
residue  vector  p(x)  then  is  used  to  check  for  errors  in  the  residue  vec¬ 
tor  ?(x) . 


For  special  kinds  of  error  processes,  the  error-checki ng  operations 
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based  on  this  method  might  prove  to  be  cost  effective.  The  method  dis¬ 
cussed  below  in  "Example:  Special  case  of  redundant  encoding"  is  a  good 
example  of  such  a  special  case.  It  should  be  pointed  out,  however,  that 
in  order  to  detect  (or  correct)  all  possible  errors  for  any  error  pro¬ 
cess,  the  range  of  the  small  moduli  i  set,  S,  must  be  at  least  as  large 
as  the  range  of  the  encoding  set  Til,  M.  If  S=M,  clearly  one  would  encode 
the  input  x  with  the  error  free  moduli  set  <4  and  not  use  the  error  prone 
set  m  at  all! 

It  is  convenient  to  separate  the  method  of  using  additional  error 
free  moduli  into  two  distinct  types:  (i)  the  range  S  divides  M  and  (ii) 
the  range  S  does  not  divide  M.  These  two-,  distinct  types  are  discussed 
in  the  two  following  subsections.  The  "S  divides  M"  type  is  handled 
first. 

(2)  Discussion  of  "S  divides  M"  Error  Checking 

Example  2  (Section  D)  gives  an  overall  description  of  the  nature, 
behavior,  and  cost  of  such  error-checking  systems.  As  indicated  in 
Example  two,  the  error-checking  method  is  capable  of  detecting  all 
errors  except  those  of  the  form: 

x  +  k<  S,  k  =  1,2,.... 

The  method  can  not  in  general  correct  all  detectable  errors.  However, 
if  the  nature  of  the  error  process  is  of  a  certain  type,  it  is  possible, 
in  principle,  to  detect  and  correct  all  errors.  Example  two  takes  note 
of  this  possibility  for  noisy  mod9  encoding  when  the  error  can  shift  the 
correct  residue  value  by  at  most  1.  This  special  case  uses  only  one 
modulus.  Tnis  can  be  generalized  to  encoding  with  n  noisy  moduli  and 
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error-checking  with  n  error  free  moduli.  The  following  example  does 


just  that. 

Example:  Special  case  of  redundant  encoding 

There  are  three  basic  requirements  that  a  given  noisy  residue 
encoding  process  must  satisfy  for  this  special  case  of  error  correcting 
to  work.  One  requirement  restricts  the  class  of  error  processes  possi¬ 
ble.  The  other  two  requirements  restrict  the  choice  of  moduli  used. 
The  requirements  are: 

(i)  the  error  process  for  each  modrtK  encoding  must  satisfy  the  follow¬ 
ing: 


Inoisy  x  modm^  encoding  -  true  x  modm^  value  I  =  I  error  I  £ 
maximum  residue  shift  from  error  =  maxrs  < 


the  modulus  rt^  for  an  possible  error, 

(ii)  there  must  exist  an  error  free  modulus  s.^  for  each  rr^  that  satis¬ 
fies: 


s^£  2»  maxrs+1 , 

(iii)  and  the  error  free  modulus  s^  must  divide  each  n^,  the  modulus 
that  s^  checks. 

Example  2  (Section  D)  gives  an  example  satisfying  t'ne  above 
requirements .  The  maximum  residue  shift  from  error  is  1.  The  modulus 
=  9.  The  error  checking  modulus  is  s1  =  3  (which  obviously  divides 
9).  The  cost  of  correcting  errors  is: 
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Cost  of  correcting  errors  =  cost(x  mod3  encoding) 

♦  cost'’  OpEn  operation  with  3x9  =  27  DOF). 

A  generalization  of  this  error  correction  procedure  is  straight 
forward.  For  every  m^ein  =  {nr  such  that  i=1,2,...,n}  =  "noisy  encoding 
set"  there  must  be  a  corresponding  s^  =  { !  i=1,2,...,n)  =  "error¬ 
checking  encoding  set"  such  that  the  three  above  requirements  are  met. 
The  error-checking  operation  OpEn  used  to  correct  the  noisy  residue 
values  x  modnK  can  be  decomposed  into  n  independent  operations.  There 
exists  a  one-to-one  ocrrespondence  between  the  "ith  residue  pair" 
dc-fined  by 

ith  residue  pair  A  (xmodn^  +  error,  xmods^  ) 
and  the  "ith  independent  error-checking  operation." 

This  decomposition  can  be  done  because  each  modulus  jik  (or  s^  supplies 
information  about  the  input  value  x  independent  of  all  the  otner  moduli 
in  the  set  7TK  or  <J).  Tne  moduli  pairs  (rrr.s^)  are  redundant  however, 
and  therefore  x  mods^  supplies  information  about  x  that  is  not  indepen¬ 
dent  of  the  information  supplied  by  the  x  modnr  value. 

Tne  ith  independent  error  correction  operation  in  OpEn  requires 

s^  rrK  DOF. 

Therefore  the  cost  of  the  ith  operation  is: 

Cost  of  correction  x  modnv  error  =  cost(x  modsi  encoding) 

+  cost( operation  with  s  m  ^  DOF). 

The  total  cost  of  error  correcting  then  is: 


Cost  of  error  correcting  r(x)  =  cost(encode  x  with  d) 

n 

♦  2  cost(  operation  with  s.*m.  DOF). 

i=1  1 

The  following  example  illustrates  the  essentials  of  this  method. 
Example 

Suppose  encoding  is  done  with  the  set  IT)  =  (121,169).  If  the  max¬ 
imum  residue  shift  from  error  is  5,  then  the  set  d=  (11,13)  could  be 
used  to  check  for  all  errors  in  residue  encoding  with  modulus  set 
The  set  &  could  also  be  used  to  check  for  all  errors  in  a  general  T 
operation  if  the  maximum  residue  shift  remains  5.  The  cost  of  error 
correcting  is: 

Cost  of  error  correcting  =  costCencode  x  with  d) 

+  cost( operation  with  11x121  DOF) 

+  cost( operation  with  13x169  DOF). 

The  nature  of  error  processes  intrinsic  to  analog  encoding  of  residues 
often  satisfy  the  requirements  above.  Therefore  this  special  case  of 
error  correcting  should  prove  to  be  useful  in  situations  where  analog 
processes  are  used  to  do  mod  encoding  and  residue  arithmetic  in  general. 

(3)  Discussion  Of  "S  does  not  divide  M"  Error  Checking 

Example  one  (  Section  D)  is  a  special  case  of  this  type  of  error¬ 
checking.  In  the  example,  x  mod2  encoding  is  used  to  detect  all  odd 
shift  errors.  One  might  nope  that  detecting  even  shift  errors  is  just 
as  simple.  Unfortunately,  it  is  not.  If  some  small  odd  modulus  s  is 
used  to  check  for  even  shift  errors,  it  can  not  detect  even  shift  errors 
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of  the  form: 


+  k-2-s,  k=1 ,2 . 

where  x  is  the  input  scalar  and  "5  is  some  small  odd  modulus  assumed  to 
be  error  free. 

As  an  example,  consider  Table  III. 2  in  Example  two  (Section  D) . 
Suppose  the  input  were  0.  If  the  small  odd  modulus  were  3,  the  value  x 
mod3  could  not  detect  the  error 

x  mod9  +  error  =0+6=0+  2x3 

At  best,  error-checking  methods  based  on  an  error  free  set  of  moduli 
&  =  (2,  and  other  relatively  prime  odd  moduli)  can  detect  all  errors 

except  those  of  the  form 

x  +■  k-S,  k=1 ,2, 3, . . .  (6.1) 

where  S  is  the  range  of  £.  In  general  these  methods  can  not  correct  all 
detectable  errors. 

The  cost  of  error-checking  is: 

Cost  of  error-checking  =  cost(encode  with  A  moduli  set) 

+  cost(0pEn  operation  with  S*M  DOF).  (6.2) 


(4)  Final  Discussion 

Special  structure  in  the  error  process  intrinsic  to  the  mod  encod¬ 
ing  (or  residue  arithmetic  operation)  can  lead  to  error-checking  methods 
capable  of  detecting  and  even  correcting  all  errors.  The  method 
described  in  "Example:  Special  case  of  redundant  encoding"  is  a  non¬ 
trivial  example.  In  general,  however,  error-checking  based  on 
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G.  SOME  MORE  ERROR  CHECKING  METHODS 

( 1 )  General  Discussion 

The  discussion  of  the  three  main  topics  in  this  section  is  for  the 
most  part  heuristic.  First,  methods  of  error-checking  other  than  the 
use  of  additional  error  free  moduli  are  discussed.  Second,  methods 
using  the  pre-encoding: 

x  - v(  x) 

as  defined  in  the  "Definition  for  x"  (Section  E)  are  briefly  covered. 
Lastly,  a  way  to  perform  the  residue  arithmetic  operations 

x  modmi  - T(x)  modrn.,  i  =  l,2 . n  (7.1) 

using  a  set  of  small  moduli  whose  range  is  >_  nr  for  all  i=1,2 . n  is 

discussed . 

(2)  Error-Checking  Without  Additional  Error-Free  Residue  Encoding 

The  partitioning  methods  given  in  (i),  (iii),  (iv),  and  (v)  in  (6) 
Some  Adhoc  Partitioning  Methods  (Section  E)  are  examples  of  such 
methods.  Is  it  possible  that  these  methods  can  do  a  substantially 
better  job  of  error-checking  than  methods  using  additional  error  free 
residue  encoding  for  the  same  cost?  This  appears  doubtful.  The  follow¬ 
ing  discussion  indicates  why  this  is  so. 

A  given  error-checki ng  operation  for  some  residue  arithmetic  opera¬ 
tion  T, 

r(  x) - 1 — >■  t(  x) 
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must  generate  nunbers  other  than  residue  values  from  the  input  (x), 
r(x),  and  the  output  t(x).  The  numbers  are  then  compared.  If  there  are 
M  possible  inputs  (and  outputs)  and  the  number  of  possible  errors  for 
any  given  input  is  N  then  any  error-checking  operation  has,  in  general, 

M*N  DOF. 

However,  there  appear  to  be  two  reasons  why  such  methods  may  prove 
cost  effective  in  some  situations: 

(i)  error-checking  operations  which  generate  partitions  different  from 
those  that  can  be  easily  generated  with  residue  encoding  may  be 
necessary  and 

(ii)  ordinary  scalar  arithmetic  operations  on  residue  x  modm.  vaiues  may 
prove  to  be  easier  (less  costly)  to  perform  than  residue  encoding. 


(3)  Alternate  x  Input  Representations 


In  "Definition  for  x"  (  Section  E)  mention  is  made  of  pre-encoding 
the  scalar  input  x  to  some  alternate  representation: 


x  >  (x1 ,x2,x3, . . . ,xn)where  x  =  x1  x?  x^  ...  xp. 

Why  do  this  at  all?  Suppose  the  input  domain  1=  10,1,2 . M-1), 

Then  it  might  be  possible  to  use  a  set  of  x..  such  that  each  xi  satisfies 

*i  <  vM  ■ 

If  each  element  of  v(x)  is  residue  encoded  separately,  then  the  range  of 
the  residue  encoding  process  is  reduced  to  only 


This  reduction  in  the  range  necessary  to  encode  x  is  significant 
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and  certainly  would  help  make  systems  designed  to  perform  arithmetic 
operations  based  on  a  residue  number  system  less  susceptible  to  encoding 
error.  Unfortunately,  using  the  v(x)  representation  for  x  does  destroy 
the  nice  structure  for  arithmetic  operations  inherent  in  a  residue 
number  system  with  the  full  range  M.  Hence,  the  cost  of  "doing  busi¬ 
ness"  ,  i.e.,  the  cost  of  performing  an  actual  arithmetic  operation  T 
will  substantially  increase. 

(4 )  Small  Residues  To  Do  Large  Residue  Arithmetic  Operations 

For  some  residue  arithmetic  operation  T  each  residue  x  modrrK  is 
mapped  to: 

x  modrrr  - »-  T(x)  modnu,  i  =  1,2 . n.  (7.1) 

But  what  is  the  mapping  in  (7.1)?  It  is  a  mapping  from  the  finite  set 
10, 1 ,2, . . . ,m  -1 }  for  all  i=1,2,...,n  to  itself.  Therefore  residue 
arithmetic  could  be  used  to  accomplish  it! 

Suppose  we  have  a  moduli  set  C=  (qi  !  i=1,2 . k)  where  the  range 

of  C  is  0  and  Q_>  rrn  for  all  i=1,2,...,n.  Then  this  one  set  of  small 
moduli  could  used  to  compute  all  the  mappings  in  (7.1)! 

The  cost  of  doing  each  residue  arithmetic  operation  directly  in 
(7.1)  is: 

Cost  of  it'n  residue  operation  =  cost(operation  with  riK  DOF) 

+  cost(error  with  m.). 

l 

The  cost  of  doing  each  residue  operation  with  the  set  Q  is: 

Cost  of  it'n  residue  operation  =[  cost(encode  x  modrK  with  0)  ] 
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♦  2  cost(operation  with  q.  DOF) 

i=1  1 

♦  cost(error  with  Q) 

+  [  cost(Decode  to  T(x)  modrrr)  3. 

I 

The  cost  of  encoding  and  decoding  with  Q  are  in  brackets  because  it  is 
possible  to  design  a  system  that  performs  many  different  T  mappings  such 
that  the  encoding  with  Q  is  done  only  once  before  all  the  T  mappings  and 
the  decoding  is  done  only  once  after  all  the  T  operations. 

If  it  is  intrinsically  easier  to  generate  x  modm.  residue  encoding 
for  large  rru  than  it  is  to  generate  the  arbitrary  T  mappings  in  (7.1), 
this  method  may  prove  particularly  cost  effective. 
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H.  SUMMARY 


A  general  system  concept  able  to  incorporate  all  residue  arithmetic 
operations  and  error  correcting/detecting  methods  as  special  cases  is 
developed.  The  block  diagram  representation  in  Fig.  4.3  depicts  this 
system . 

A  methodology  for  research  into  the  intrinsic  capabilities  of  error 
correcting/detecting  methods  is  proposed.  The  motivation,  main  develop¬ 
ment,  and  statement  of  this  methodology  is  given  in  Section  E. 

Several  simple  examples  of  error  correcting/detecting  methods  are 
given.  In  particular,  the  method  explained  in  "Example:  Special  case  of 
redundant  coding"  might  prove  to  be  cost  effective  when  analog  processes 
involving  thresholding  are  used  to  perform  residue  arithmetic  opera¬ 
tions. 

The  topic  of  system  complexity  and  resultant  cost  is  an  important 
one.  The  need  to  develop  physically  meaningful  measures  of  system  com¬ 
plexity  is  noted  in  (5)  Statement  of  Research  Methodology  (Section  E). 
The  costs  of  various  error-checking  methods  are  discussed  in  the  exam¬ 
ples  given.  Upon  comparing  the  form  of  the  costs  as  given,  it  is  clear 
that  more  quantitative  statements  need  to  be  made  in  order  to  chose  with 
confidence  between  some  of  the  different  methods. 
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APPENDIX 


ADDRESS 

CONT 

ADDRESS 

CONT 

ADDRESS 

000000 

xooo 

010101 

xxxx 

101010 

000001 

X001 

010110 

xxxx 

101011 

UuUOlu 

X010 

010111 

xxxx 

101100 

OOOoll 

XUll 

OlluUJ 

X011 

101101 

0001U0 

X100 

OlloOl 

X10U 

101110 

000101 

xxxx 

OllOiO 

xooo 

lullll 

000110 

xxxx 

011011 

XOOI 

1 1 000  0 

000111 

xxxx 

011100 

X010 

1100O1 

OOluOO 

xooi 

011101 

XXXX 

110010 

001001 

X010 

011110 

XXXX 

110011 

001010 

X011 

011111 

XXXX 

110100 

001011 

X100 

lOOuGO 

X100 

110101 

001100 

xooo 

100001 

xooo 

110110 

001101 

xxxx 

100010 

XOOI 

110111 

001110 

xxxx 

100011 

X010 

111000 

001111 

xxxx 

100100 

X011 

111001 

010000 

X010 

100101 

xxxx 

111010 

010001 

X011 

100110 

xxxx 

111011 

010010 

X100 

loom 

xxxx 

111100 

010011 

xooo 

101000 

xxxx 

111101 

010100 

XOOI 

101001 

xxxx 

1111L0 

mill 

TABLE  T1 


CONT 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 
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address 

cout 

OOOOOO 

xooo 

000001 

XU01 

GGuOlO 

X010 

000011 

X011 

000100 

X100 

oooioi 

X101 

000110 

X110 

000111 

xxxx 

00100O 

XOOi 

001001 

XGlu 

U01O10 

X011 

OOlOll 

X100 

001100 

X101 

001101 

xnn 

001110 

xooo 

001111 

xxxx 

010000 

X010 

010001 

X011 

010010 

X100 

010011 

X101 

010100 

X110 

TABLE  T2 


ADDRESS 

CONT 

000000 

xooo 

UOOOOl 

xooi 

Ouuolu 

XGlu 

000011 

X011 

0U0100 

X100 

000101 

X101 

000110 

X110 

000111 

Xlll 

uOlOOO 

XOOI 

O01001 

X010 

001010 

X011 

OOlOll 

xioo 

001100 

X101 

U011J1 

X110 

001110 

Xlll 

001111 

XOOO 

010000 

X010 

010001 

X011 

010010 

xioo 

010011 

X101 

010100 

X110 

010101 

xooo 

010110 

XOOI 

010111 

xxxx 

OliUOO 

X011 

011001 

xioo 

011010 

X101 

011011 

X110 

011100 

xooo 

011101 

XOOI 

011110 

X010 

011111 

xxxx 

100000 

XIOO 

100001 

X101 

100010 

X110 

100011 

xooo 

100100 

XOOI 

100101 

X010 

100110 

X011 

loom 

xxxx 

101000 

Xiol 

101001 

X110 

ADDRESS 

CONT 

010101 

Xlll 

010110 

xooo 

010111 

XOUl 

011000 

X011 

011001 

XlOu 

011010 

Xlul 

ullull 

Xllu 

011100 

Xlll 

011101 

XOOO 

011110 

XOOI 

011111 

X010 

1000UU 

XIOO 

lOOUUl 

X101 

100010 

X110 

100011 

Xlll 

100100 

xooo 

100101 

xooi 

100110 

X010 

100111 

X011 

101000 

X101 

101001 

X110 

add(  js 

COMT 

101010 

XOOO 

101011 

XOOI 

101100 

X010 

101101 

XUll 

101110 

XI  DO 

101111 

XXXX 

110000 

Xllu 

110001 

xooo 

110010 

XOOI 

lluoil 

X010 

110100 

X011 

110101 

XIOO 

110110 

X101 

110111 

XXXX 

111000 

xxxx 

111001 

xxxx 

111010 

xxxx 

111011 

xxxx 

111100 

xxxx 

llllul 

xxxx 

111110 

xxxx 

mill 

xxxx 

ADDRESS 

CONT 

101010 

Xlll 

101011 

XOOo 

101100 

XOOI 

101101 

xuio 

101110 

X011 

lullll 

XluQ 

110000 

xllu 

110001 

xlll 

110010 

xooo 

110011 

XOOI 

110100 

X010 

110101 

X011 

110110 

xioo 

110111 

X101 

111000 

Xlll 

111001 

XOOO 

111010 

XOOI 

1} 1011 

X010 

111100 

X011 

111101 

XIOO 

111110 

X101 

111111 

X110 

TABLE  T3 
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ADDRESS 

CCi 

ADDRESS 

COHT 

/k  *-j  0  R  E  u 

COMT 

OOOUOO 

xuoo 

010101 

xxxx 

101010 

xxxx 

OOOOOl 

XOOl 

010110 

xxxx 

101011 

xxxx 

ooauio 

X010 

010111 

xxxx 

101100 

xxxx 

000011 

X011 

OlluOO 

xxxx 

101101 

xxxx 

000100 

XIOO 

011001 

xxxx 

101110 

xxxx 

U00101 

X101 

011010 

xxxx 

101111 

xxxx 

000110 

X110 

U11011 

xxxx 

110000 

xxxx 

000111 

Xlll 

011100 

xxxx 

110001 

xxxx 

00100a 

xxxx 

011101 

xxxx 

110010 

xxxx 

001001 

xxxx 

Ollliu 

xxxx 

110011 

xxxx 

uOIOIU 

xxxx 

oillll 

xxxx 

110100 

xxxx 

001011 

xxxx 

100000 

xxxx 

110101 

xxxx 

001100 

xxxx 

100001 

xxxx 

110110 

xxxx 

001101 

xxxx 

100010 

xxxx 

110111 

xxxx 

001110 

xxxx 

100011 

xxxx 

111000 

xxxx 

001111 

xxxx 

100100 

xxxx 

111001 

xxxx 

010000 

xxxx 

100101 

xxxx 

111010 

xxxx 

010001 

xxxx 

100110 

xxxx 

111011 

xxxx 

010010 

xxxx 

loom 

xxxx 

111100 

xxxx 

010011 

xxxx 

101000 

xxxx 

111101 

xxxx 

010100  xxxx 

TABLE  T4 

101001 

xxxx 

111110 

111111 

xxxx 

xxxx 

ADDRESS 

CONT 

ADDRESS 

CONT 

ADDRESS 

r> 

o 

OuOOOU 

xoou 

010101 

XOOl 

101010 

X011 

UUOUOl 

XOOl 

010110 

XOIO 

101011 

XIOO 

000010 

XOIO 

010111 

XUll 

101100 

XOOO 

000011 

X011 

OlluOO 

XIOO 

101101 

XOOl 

000100 

XIOO 

011001 

XOOO 

101110 

XOIO 

000101 

XUOO 

011010 

XOOl 

101111 

X011 

000110 

XOOl 

011011 

XOIO 

110000 

XIOO 

000111 

XOlo 

011100 

X011 

11U001 

XOOO 

OOlUOU 

X011 

011101 

XIOO 

11U010 

XOOl 

001001 

XIOO 

ullllu 

XOOO 

110011 

XOIO 

uoioio 

XOOO 

Oilin' 

XOOl 

110100 

X011 

001011 

xuui 

100000 

X011 

110101 

XIOO 

001100 

XOIO 

10U001 

XIOO 

110110 

XOOO 

001101 

X011 

100010 

XOOO 

110111 

XOOl 

001110 

X1U0 

100011 

XOOl 

111000 

XOIO 

001111 

XOOO 

100100 

XOIO 

111001 

X011 

010G0U 

XU01 

100101 

X011 

111010 

XIOO 

010(101 

XOIO 

100110 

XIOO 

111011 

XOOO 

010010 

X011 

loom 

XUOO 

111100 

XOOl 

01U011 

XIOO 

101000 

XOOl 

111101 

XOIO 

010100 

XOOO 

101001 

XOIO 

111110 

X011 

linn  xioo 
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TABLE  T5 


ADDRESS 

COWT^- 

ADDRESS 

CONT 

ADDRESS 

CO  N  T 

OOUOdG 

XOUO 

010101 

xooo 

10  1 0  1 0 

X11G 

OOOUOl 

XOOI 

01C110 

XOU 1 

J01011 

xooo 

OGUOIO 

X010 

010111 

X010 

luilou 

XOOI 

OOOOll 

X011 

011000 

X01 1 

101101 

X010 

000100 

X100 

011001 

X100 

101110 

X011 

000101 

X101 

011010 

X101 

101111 

X10U 

UOOllU 

X110 

011011 

X110 

110000 

X101 

000111 

xooo 

011100 

xooo 

liuuoi 

X110 

001000 

X001 

011101 

XOOi 

110010 

XOOO 

001001 

X010 

011110 

X010 

110011 

XOOI 

001010 

X011 

011111 

X011 

110100 

X010 

001011 

X100 

100000 

X011 

110101 

X011 

001100 

X101 

100001 

X100 

11G11U 

X100 

001101 

X110 

100010 

X101 

110111 

X101 

001110 

xooo 

100U11 

X110 

111000 

X110 

001111 

XOOI 

100100 

XOOO 

111001 

XOOO 

010000 

X010 

100101 

XOOI 

111010 

XOOI 

010001 

X011 

100110 

X010 

111011 

XUlO 

010010 

X100 

loom 

X011 

111100 

X011 

010011 

X101 

101000 

X100 

111101 

X100 

010100 

X110 

101001 

X101 

111110 

min 

X101 

X110 

TABLE  T6 


ADDRESS 

CONT 

ADDRESS 

CONT 

ADDRESS 

CONT 

OO0000 

xooo 

010101 

X101 

10101U 

X010 

OOOUOl 

XOOI 

0101 lu 

X110 

101011 

X011 

000010 

X010 

010111 

Xlll 

101100 

X100 

OOOOll 

X011 

G110UU 

XOOO 

101101 

X101 

000100 

X100 

011001 

XOOI 

101110 

X110 

000101 

X101 

011010 

X010 

101111 

Xlll 

000110 

X110 

011011 

XUll 

110000 

xooo 

000111 

Xlll 

011100 

X100 

110G01 

XOOI 

001000 

xooo 

011101 

X101 

11U010 

X010 

001001 

XOOI 

011110 

X110 

11 001 1 

X011 

OOlulO 

X010 

011111 

Xlll 

110100 

XlOO 

001011 

X011 

100000 

xooo 

110101 

X101 

001100 

X100 

100031 

XOOI 

110110 

X110 

001101 

X101 

lOUOlO 

X010 

110111 

Xlll 

001110 

X110 

100011 

X011 

111000 

xooo 

001111 

Xlll 

100100 

XI 00 

111U01 

XOOi 

010000 

xooo 

100101 

X101 

111010 

XOlU 

010001 

XOOI 

1001 10 

X110 

111011 

X011 

010U10 

X010 

1 00 i 1 1 

Xlll 

111100 

XlOO 

U 1 00 1 1 

X011 

1010UU 

xooo 

111101 

X101 

010100 

X 1 00 

101001 

XOOI 

111110 

linn 

XI  lu 
Xlll 

TABLE  T7 
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ADDRESS  CONI 

UUGOOO  XOOO 
OOuOOl  XOll 
UOUOIO  X110 
UOOOll  XOIO 
OOOIOO  X101 
000101  XU01 
000110  X100 

000111  xxxx 

001000  X100 

001001  xooo 

001010  XOll 
001J11  Xlio 
001100  XOIO 
001101  X101 

001110  X001 

001111  XXXX 
OlOOOO  X001 
010001  X100 

010010  xooo 

010011  XOll 
010100  X110 


TABLE  T8 


ADDRESS  CONT 

000000  xooo 

000001  X101 

OOUUlO  XOIO 

oooon  xm 

OOOIOO  X100 
000101  X001 

OOUllU  Xlio 

uoom  xoii 

OOIOOO  XOll 
001001  XOOO 
001010  X101 

001011  XOIO 
OOlluO  Xlll 
001101  X100 

001110  X001 

001111  Xlio 
OlOOOO  Xlio 
010001  XOll 
010010  XOOO 
010011  X101 

010100  XOIO 


ADDRESS  CONT 

OlUlOl  XOIO 
Oluilu  X101 
010111  xxxx 
OllOuO  X101 
01 1 001  XUU1 
011010  X100 

011011  xooo 

011100  XOll 
011101  X110 

011110  XOIO 
011111  xxxx 
100000  XOIO 
100001  X101 

10U01U  X001 

100011  X100 

100100  xooo 

100101  XOll 

100110  Xlio 
loom  xxxx 
101000  xxxx 
101001  xxxx 


ADDRESS  CONT 

010101  Xlll 

010110  X100 

010111  X001 

011000  X001 

011001  X110 

011010  XOll 

011011  xooo 

011100  X101 

011101  XOIO 
011110  Xlll 
011111  Xl&o 
100000  X100 

100001  X001 

1U0010  Xlio 
100011  XOll 

100100  xooo 

100101  X101 

100110  XOIO 

loom  xiii 
101000  xxxx 
101001  xxxx 


ADDRESS  CONT 

101010  xxxx 
101011  xxxx 
lulioo  xxxx 
101101  xxxx 
10111U  xxxx 
loim  xxxx 
110U0Q  XXXX 
110001  xxxx 
110010  xxxx 
llooil  xxxx 
110100  xxxx 
110101  xxxx 
110110  xxxx 
110111  xxxx 
uiouo  xxxx 
111001  xxxx 
111010  xxxx 
men  xxxx 
111100  xxxx 
111101  xxxx 
111110  xxxx 
linn  xxxx 


address  cont 

101010  xxxx 
101011  xxxx 
101100  xxxx 
101101  xxxx 
101110  xxxx 
101111  xxxx 
11U00U  xxxx 
llOuOl  xxxx 
licoiu  xxxx 
110011  xxxx 
110100  xxxx 
110101  xxxx 
none  xxxx 
110111  xxxx 
111000  xxxx 
111U01  xxxx 
111010  xxxx 
111011  xxxx 
111100  xxxx 
111101  xxxx 
11)110  xxxx 
linn  xxxx 


ADDRESS 

CONT 

ADDRESS 

CONT 

ADDRESS 

o 

p 

OuOOOO 

xooo 

01G1U1 

xiui 

101010 

XUll 

OOOOOl 

Xlll 

010110 

XIOl) 

101011 

XG1C 

000010 

X110 

010111 

X011 

101100 

X001 

000011 

X101 

011000 

X011 

101101 

XuOO 

000100 

X100 

011001 

X010 

101110 

Xlll 

OOUlOl 

X011 

011010 

X001 

101111 

X110 

00011U 

X010 

011011 

XuOO 

110000 

X110 

000111 

X001 

ulllGU 

Xlll 

110001 

XIOl 

001000 

X001 

011101 

X11G 

110010 

X100 

001001 

XOOO 

011110 

XIOl 

110011 

X011 

001010 

Xlll 

011111 

X100 

110100 

X010 

001011 

X110 

100000 

X100 

110101 

X0U1 

0U110G 

X101 

100001 

X011 

110110 

xooo 

OOllUl 

X100 

100010 

X010 

110111 

Xlll 

001110 

X011 

100011 

X001 

111000 

xxxx 

001111 

X010 

100100 

xooo 

111001 

xxxx 

010000 

X010 

100101 

Xlll 

111010 

xxxx 

010001 

X001 

100110 

X110 

111011 

xxxx 

010010 

xooo 

loom 

XIOl 

111100 

xxxx 

010011 

Xlll 

101000 

XIOl 

111101 

xxxx 

010100 

X110 

101001 

X100 

111110 

xxxx 

111111 

xxxx 

TABLE  T10 

ADDRESS 

CONT 

ADDRESS 

CONT 

ADDRESS 

CONT 

000000 

UOOU 

010101 

xxxx 

101010 

xxxx 

OUOOOl 

0001 

010110 

xxxx 

101011 

xxxx 

000010 

U010 

010111 

xxxx 

101100 

xxxx 

U00011 

0011 

011000 

xxxx 

101 1 G 1 

xxxx 

00  U1 00 

0100 

011001 

xxxx 

101110 

xxxx 

000101 

xxxx 

011010 

xxxx 

1  u  1 1 1 1 

xxxx 

000110 

xxxx 

011011 

xxxx 

11U000 

xxxx 

OOolll 

xxxx 

01 1 1  GO 

xxxx 

110001 

xxxx 

OOIOOO 

xxxx 

011101 

xxxx 

110010 

xxxx 

001001 

xxxx 

011110 

xxxx 

110011 

xxxx 

001010 

xxxx 

011111 

xxxx 

110100 

xxxx 

001011 

xxxx 

lOOusjO 

xxxx 

110101 

xxxx 

001100 

xxxx 

100G01 

xxxx 

110110 

xxxx 

001101 

xxxx 

100010 

xxxx 

110111 

xxxx 

001110 

xxxx 

10UU11 

xxxx 

111000 

xxxx 

001111 

xxxx 

100100 

xxxx 

111001 

xxxx 

010000 

xxxx 

100101 

xxxx 

111010 

xxxx 

010001 

xxxx 

100110 

xxxx 

111011 

xxxx 

010010 

xxxx 

100111 

xxxx 

111100 

xxxx 

010011 

xxxx 

lOloOO 

xxxx 

111  101 

xxxx 

010100 

xxxx 

101001 

xxxx 

illlio 

xxxx 

111111 

xxxx 

- 
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TABLE  TU 


ADDRESS  CONT 


ADDRESS  CONT 


ADDRESS  CONT 


GOOOOJ  0000 
000001  0000 
000010  0000 
000011  0000 
000100  1111 
U00101  1111 
000110  1111 
000111  1111 
001000  0000 
001001  0000 
001010  0000 
001011  0000 
001100  1111 
001101  1111 
001110  1111 
001111  1111 
010000  0000 
010001  0000 
010010  0000 
010011  0000 
010100  1111 


TABLE  T12 
ADDRESS  CONT 

000000  oooo 
000001  0010 
000010  0100 
000011  0110 
000100  0111 
000101  1001 
000110  1011 
0G0111  1101 

UOIOOO  0000 
001001  0010 
001010  0100 
001011  Olio 
001100  0111 
001101  1001 
001110  1011 
001111  1110 
010000  0000 
010001  0010 
010010  0101 
010011  0111 
OlOlUO  0111 


010101  1111 
010110  1111 
Olulll  1111 
011000  0000 
011001  oooo 
OllUlO  oooo 
011011  oooo 
omao  mi 

011101  nil 
011110  1111 
011111  1111 
100000  oooo 
100001  oooo 
100010  oooo 
100011  oooo 
1001 00  1111 
100101  1111 
100110  1111 
loom  mi 
101000  oooo 

101G01  0000 


ADDRESS  CONT 

010101  1010 
010110  1100 
uioin  mo 

011000  OOOU 
011001  0011 
011010  0101 
OllUll  0111 
011100  1000 
011101  lulu 
011110  1100 
011111  1110 
100000  0001 
100001  0011 
100010  0101 
1U0011  0111 

100100  luoo 
100101  1010 
lOOilU  1100 

loom  mi 
101000  0001 
101001  0011 


101010  U0U0 
101011  000 J 

101100  mi 
101101  1111 
101110  1111 
101111  1111 
IIUUOU  oooo 

110001  oooo 
110010  oooo 
110011  oooo 
110100  1111 
110101  1111 
110110  1111 
110111  1111 
111000  xxxx 
111001  xxxx 
111010  xxxx 
moil  xxxx 
111100  xxxx 
111101  xxxx 
111110  xxxx 
111111  xxxx 


ADDRESS  CONT 

101010  0101 
101011  1000 
101100  1000 
101101  1011 
101110  1101 
101111  1111 
110000  0001 
110001  0100 
110010  0110 
110011  1000 
110100  1001 
110101  1011 
110110  1101 
110111  1111 
111000  xxxx 
111001  xxxx 
111010  xxxx 
moil  xxxx 
111100  xxxx 
111101  xxxx 
111110  xxxx 
111111  xxxx 


TABLE  T13 
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C  1  c 


ADDRESS 

CONT 

ADDRESS 

CONT 

ADDRESS 

000000 

0000 

010101 

0001 

101010 

U00001 

0011 

OlullO 

OlOu 

101011 

000010 

0110 

010111 

0111 

101100 

000011 

1001 

GllCOu 

1111 

101101 

000100 

0100 

ClluGl 

0010 

101110 

JGC101 

0111 

011010 

0101 

101111 

000110 

1010 

011011 

1000 

110000 

000111 

1101 

0111U0 

0011 

110001 

001000 

0101 

011101 

0110 

11001G 

001001 

1000 

011110 

1001 

110011 

001010 

1011 

011111 

1100 

110100 

001011 

1110 

100000 

0100 

110101 

001100 

1001 

lOOGul 

0111 

110110 

001101 

1100 

100010 

1010 

110111 

001110 

1111 

100011 

1101 

111000 

001111 

0010 

100100 

1000 

111001 

010000 

1010 

100101 

1011 

111010 

010001 

1101 

100110 

1110 

111011 

010010 

0000 

loom 

0001 

111100 

010011 

0011 

101 UU  0 

1001 

111101 

010100 

1110 

101001 

1100 

111110 

mill 
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ADDRESS 

CONT 

ADDRESS 

CONT 

ADDRESS 

000000 

0000 

010101 

0011 

101010 

000001 

0001 

010110 

0100 

101011 

000010 

0010 

OlOlli 

Olul 

101100 

000011 

0011 

011000 

0011 

101101 

000100 

0001 

011001 

0100 

101110 

000101 

0010 

011010 

0101 

101111 

000110 

0011 

011011 

0110 

110000 

000111 

0100 

011100 

0100 

110001 

001000 

0010 

011101 

0101 

110010 

001001 

0011 

011110 

0110 

110011 

001010 

0100 

011111 

0111 

110100 

001011 

0101 

100000 

0010 

110101 

001100 

0011 

100001 

0011 

110110 

001101 

0100 

100010 

0100 

110111 

001110 

0101 

100011 

0101 

llluoo 

001111 

0110 

100100 

0011 

111001 

010000 

0001 

100101 

0100 

111010 

010001 

0010 

100110 

0101 

111011 

010010 

0011 

loom 

0110 

111100 

O10011 

0100 

101000 

0100 

111101 

010100 

0010 

101001 

0101 

111110 

min 

CON  T 

1111 

0010 

1101 

ooou 

0011 

0110 

1110 

0001 

0100 

0111 

0010 

0101 

1000 

1011 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 

xxxx 


CONT 

0110 

0111 

0101 

0110 

0111 

1000 

0011 

0100 

0101 

0110 

0100 

0101 

0110 

0111 

0101 

0110 

0111 

1000 

0110 

0111 

1000 

1001 
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